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Field of the Invention 

This invention relates in general to novel 
biosynthetic compositions of matter having particular 
utility as in vivo targeting agents and more 
specifically, to biosynthetic dimeric constructs of 
single-chain binding proteins (sFv), conjugates 
thereof, and to methods for their production. 

Background of the Invention 

The development of murine monoclonal antibodies 
and their proteolytic Fab fragments has raised interest 
in their utility as diagnostic and therapeutic reagents 
for in vivo imaging and drug targeting. However, 
successful in vivo targeting of radionuclides, drugs or 
toxins using 150 kD intact antibodies or their 50 kD 



Fab fragments (an antibody fragment consisting of one 
light chain and approximately half of the heavy chain 
held together by a single disulfide bond) have been 
restricted by the limited penetration of these 
molecules from the vasculature into the tissues of 
interest, and by their slow clearance rates in vivo, 
which for IgG leads to behavior that requires several 
days to clear the background enough for imaging to be 
possible. Other disadvantages of the intact antibodies 
or their Fab fragments include: their immunogenic ity 
when prepared from different species, their non- 
specific binding to many normal tissues and organs, and 
the fact that they contain multiple proteolytic 
cleavage sites which result in their degradation during 
their circulation in vivo . 

Although Fv fragments, which consist of one V R 
and one V L domain held together by noncovalent 
interactions, form the minimal region of an antibody 
that contains a complete antigen combining site, 
dissociation of the V H and V L domains in vivo can 
preclude their use as therapeutic or imaging agents. 
Although Moore et al . , (US 4,642,334) and Glockshuber 
et al., (1990, Biochem. 29, 1362-1367) disclose 
attempts to stabilize these Fv fragments with , 
engineered intermolecular disulfide bonds, monovalent 
50kD Fab and Fab' fragments have, until recently, been 
the smallest proteins available for effective 
immunotargeting . 

Recently, single-chain Fv (sFv) polypeptide 
chains of about 27 kD have been developed containing 
covalently linked V R -V L polypeptides. The V H ~ and V L ~ 
domains are connected by a polypeptide linker. The 
resulting sFv polypeptide chains are also referred to 
in the art as biosynthetic antibody binding sites or 
BABS and preferably are encoded by a single DNA 
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sequence. For a detailed description of these 
biosynthetic polypeptide chains see for example, Huston 
et al. , 1988, Proc. Nat. Aca. Sci. USA 85: 5879-5883 or 
US Patents 5,091,513 and 5,132,405, all of which are 
hereby incorporated by reference. The sFv polypeptide 
chains provide attractive alternatives to intact 
immunoglobulins and Fab fragments due to their small 
size and their stability at concentrations that 
typically promote dissociation of natural Fv fragments. 
US Patents 5,091,513 and 5,132,405; Huston et al. , 
((1991) Methods in Enzymology 203: 46-88; Huston et al 
□ (1993) Int. Rev. Immunol . 10: 195-217) disclose the 

;B utility of sFv polypeptides, as well as single chain 

jjj constructs synthesized from single DNA sequences, which 

m may further comprise ancillary effector proteins, such 

~jL as a second sFv or a cytotoxic agent, 

u Pack et al. ((1992) Biochem 31; 1579-1584) 

disclose the construction of "mini-antibodies". The 
! ™i mini-antibodies are sFv polypeptide chains which also 

ifj include an "oligomerization domain" at their C-termini, 

0} separated from the sFv by a hinge region. The 

;~f oligomerization domains comprise self-associating a- 

helices, for example, leucine zippers, that can be 
further stabilized by additional disulfide bonds. The 
domains are designed to be compatible with vectorial 
folding across a membrane, a process thought to 
facilitate in vivo folding of the polypeptide into a 
functional binding protein. 

PCT application PCT/US92/09965 , published June 
10, 1993 also discloses the construction of bivalent 
sFv constructs, including crosslinked dimers. 
However, the pharmacokinetic properties of these 
constructs or those disclosed by Pack et al. are not 
measured in vivo. 
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PGT application PCT/US92/07986 , published 
April 1, 1993 discloses bifunctional (Fab') 2 molecules 
composed of two Fab' monomers linked through cysteine 
amino acids located at. the C-terminus of the first 
constant domain of each heavy chain. PCT application 
PCT/US92/10140, published June 10, 1993 also discloses 
bifunctional (Fab') 2 dimers which, in addition to the 
cysteine residues located in the hinge region, also 
contain C- terminal leucine zipper domains that further 
stabilize the (Fab') 2 dimers. In both cases, the 
resulting (Fab') 2 dimers (>100kD in size), although 
smaller than intact immunoglobulins, are significantly 
larger than sFv polypeptides and are anticipated to 
have slower tissue biodistribution and clearance rates 
following in vivo administration. 

Cumber et'al. disclose the generation of (Fv- 
Cys) 2 heterodimers by chemically crosslinking two V R - 
cys domains together (Cumber et al . , 1992, 
Immunology 149B : 120-126). Although the crosslinked V H 
chains appear to be stable, dissociation of the V L 
polypeptides from each Fv reduces the pharmacological 
value of these constructs in vivo. 

It is an object of the instant invention to 
provide biosynthetic constructs having enhanced, 
pharmacokinetic properties as in vivo targeting agents. 
In particular, it is an object of this invention to 
provide biocompatible constructs having accelerated in 
vivo biodistribution and body clearance rates than that 
of antibodies or antibody fragments. It is another 
object of the invention to provide biosynthetic 
constructs having enhanced avidity in vivo , including 
enhanced target tissue specificity and target tissue 
retention. Yet another object is to provide dimeric 
biosynthetic constructs having improved tissue imaging 



and drug targeting properties in vivo . Still another 
object is to provide diagnostic and therapeutic 
formulations comprising these constructs , having 
particular utility in the diagnosis and treatment of 
malignancies. Still another object is to provide 
constructs having enhanced pharmacokinetic properties 
as in vivo targeting agents, particularly as in vivo 
imaging agents , for ovarian and breast tumor tissue. 

These and other objects and features of the 
invention will be apparent from the description, 
figures and claims which follow. 
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Summary of the Invention 

In its broadest aspect, the invention features a 
formulation for targeting an epitope on an antigen 
expressed in a mammal, where the formulation contains a 
pharmaceutical^ acceptable carrier in combination with 
a biosynthetic construct for binding at least one 
preselected antigen. The diraeric construct has 
particular utility in diagnostic and therapeutic 
applications in vivo. 

The invention features the synthesis and use of 
monomers and dimers of polypeptide constructs belonging 
to the class of proteins known as single-chain Fv (sFv) 
polypeptides. The sFv proteins described herein have 
superior in vivo pharmacokinetic properties, including 
accelerated tissue biodistribution and clearance rates 
relative to either intact IgG, (Fab) 2 dimers or Fab. 

The dimeric biosynthetic construct of the 
invention contains two sFv polypeptide chains defined 
herein as follows. Each sFv polypeptide chain 
comprises an amino acid sequence defining at least two 
polypeptide domains. These domains are connected by a 
polypeptide linker spanning the distance between the C- 
terminus of one domain and the N- terminus of the other. 
The amino acid sequence of each domain includes 
complementarity determining regions (CDRs) interposed 
between framework regions (FRs) where the CDRs and FRs 
of each polypeptide chain together define a binding 
site immunologically reactive with a preselected 
antigen. Additionally, each biosynthetic binding site 
polypeptide chain can have an amino acid sequence 
peptide bonded and thus contiguous with the C-terminus 
of each polypeptide chain, referred to herein as a 
"C-terminal tail" sequence. The term "sFV" refers 



hereinafter, to an sFv molecule containing such a 
C-terminal tail sequence. This tail sequence 
preferably does not contain an a-helical motif that 
self-associates with another polypeptide chain of 
similar sequence but still contains a means for 
covalently crosslinking two such polypeptide chains 
together. When the two sFV polypeptide chains are 
crosslinked together, the resulting dimeric construct 
has a conformation that permits the independent binding 
of a preselected antigen or antigens to the binding 
site of each polypeptide chain in vitro and in vivo. 
The resulting dimeric constructs have superior in vivo 
pharmacokinetic properties that include significantly 
enhanced avidity, including enhanced target tissue 
retention and/or antigen localization properties, as 
compared with intact IgG, Fab, (Fab) 2 dimers or 
monomer ic sFv. 

As will be appreciated by those having ordinary 
skill in the art, the sequence referred to herein 
generally as a "C-terminal tail" sequence, peptide 
bonded to the C-terminus of an sFv and comprising means 
for crosslinking two sFv polypeptide chains, 
alternatively may occur at the N-terminus of an sFv 
("N-terminal tail") or may comprise part of the, 
polypeptide linker spanning the domains of an 
individual sFv. The dimeric species created by the 
crosslinking of sFvs having these alternative "tail" 
sequences also are contemplated to have a conformation 
permitting the in vivo binding of a preselected antigen 
by the binding sites of each of the sFv polypeptide 
chains. Accordingly, descriptions of how to make and 
use sFv' monomers and dimers comprising a C-terminal 
tail sequence are extended hereby to include sFv 
monomers and dimers wherein the tail sequence having 



crosslinking means occurs at the N-terminus of an sFv 
or comprises part of the polypeptide linker sequence. 

In one embodiment, both polypeptide chains bind 
the same epitope on a preselected antigen, and the 
resulting dimeric construct is termed a "homodimer . " 
In another embodiment, the polypeptide chains bind 
different epitopes on a preselected antigen and the 
resulting dimeric construct is termed a "heterodimer . " 
In still another embodiment, the two polypeptide chains 
bind different epitopes on two different, preselected 
antigens . 

The term "epitope", as used herein, refers to a 
portion of an antigen that makes contact with a 
particular antibody or antibody analogue. In a typical 
protein, it is likely that any residue accessible from 
the surface can form part of one or more antigenic 
determinants. The term "antigen", as used herein, 
refers to a molecule that can elicit an immune response 
and that can react specifically with corresponding 
antibodies or antibody analogues. 

The terra "domain", as used herein, refers to an 
amino acid sequence that folds into a single globular 
region in its native conformation, and which may 
exhibit discrete binding or functional properties. The 
term "polypeptide linker", as used herein, refers to an 
amino acid sequence that links the C-terminus of one 
domain to the N-terminus of the other domain, while 
still permitting the two domains to maintain their 
proper physiologically active binding conformations. 
In a particular aspect of the invention, the currently 
preferred polypeptide linkers that connect the 
C-terminus of one domain to the N-terminus of the other 
domain include part or all of amino acid sequence 
((Gly) 4 Ser) 3 set forth in the SEQ. ID. NO. : 7, or 
((Ser) 4 Gly) 3 as' set forth in SEQ. ID. NO.: 8. 



The amino acid sequence of each of the 
polypeptide domains includes complementarity 
determining regions interposed between framework 

regions. The term "complementarity determining 

regions" or "CDRs", as used herein, refer to amino acid 
sequences which together define the binding affinity 
and specificity of the natural Fv region of a native 
immunoglobulin binding site, or a synthetic polypeptide 
which mimics this function. CDRs are not necessarily 
wholly homologous to hypervariable regions of natural 
Fv molecules, and also may include specific amino acids 
or amino acid sequences which flank the hypervariable 
region and have heretofore been considered framework 
not directly determinative of complementarity. The 
term "framework regions" or "FRs " , as used herein, 
refers to amino acid sequences which are found 
naturally occurring between CDRs in immunoglobulins. 
These FR sequences may be derived in whole or part from 
the same immunoglobulin as the CDRs, or in whole or 
part from a different immunoglobulin. For example, in 
order to enhance biocompatibility of an sFv to be 
administered to a human, the FR sequences can be 
derived from a human immunoglobulin and so the 
resulting humanized sFv will be less immunogenic than a 
murine monoclonal antibody. 

The amino acid sequence of each variable domain 
includes three CDRs interspersed between four FRs. The 
two polypeptide domains that define an sFv molecule 
contain CDRs interspersed between FRs which together 
form a binding site immunologically reactive with a 
preselected antigen. The term "immunologically 
reactive", as used herein, refers to the noncovalent 
interactions of the type that occur between an 
immunoglobulin molecule and an antigen for which the 
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immunoglobulin is specific. As used herein, the term 
"avidity" describes the stability of a complex formed 
by a multivalent antibody or antibody analogue, with 
its binding conjugate. Also as used herein, the term 
"apparent avidity" describes the stability of a complex 
formed by an antibody or an antibody analogue with its 
binding conjugate as determined by in vivo 
immunolocalization studies. 

In a preferred aspect of the invention, the CDRs 
of the polypeptide chain can have an amino acid 
sequence substantially homologous with at least a 
portion of the amino acid sequence of CDRs from a 
variable region of an immunoglobulin molecule from a 
first species, together with FRs that are substantially 
homologous with at least a portion of the amino acid 
sequence of* FRs from a variable region of an 
immunoglobulin molecule from a second species. 
Preferably, the first species is mouse and the second 
species is human. The CDR sequences in the sFv' 
polypeptides are preferably substantially homologous to 
an immunoglobulin CDR retaining at least 70%, or more 
preferably 80% or 90%, of the amino acid sequence of 
the immunoglobulin CDR, and also retains the 
immunological binding properties of the immunoglobulin. 

Each sFv' molecule has a C-terminal polypeptide 
tail that has a non-self-associating structure and 
contains at least one crosslinking means. Useful 
crosslinking means include derivatizable amino acid 
side chains, particularly those selected from the group 
consisting of cysteine, lysine, arginine, histidine, 
glutamate, aspartate, and derivatives and modified 
forms thereof. In a preferred aspect of the invention, 
cysteine amino acids are incorporated into the 
C-terminal tail sequences as the crosslinking means. 
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In another aspect of the invention, the crosslinking 
means includes one or more amino acids that can be 
posttranslationally modified. For example, the 
crosslinking means can include one or more 
glycosylation sites, wherein the incorporated 
carbohydrate moieties can be crosslinked in vitro. 
Preferred glycosylation sequences include Asn-Xaa-Thr 
and Asn-Xaa-Ser, where Xaa can be any amino acid, 
wherein the carbohydrate is typically N-linked to 
asparagine or O-linked to serine or threonine. 

Additionally, the tail also may comprise an amino 
acid sequence that defines a metal ion chelation motif, 
and which facilitates purification of the sFV monomers 
by metal ion affinity chromatography, such as the 
IMAC 2+ chromatography system. Furthermore, chelation 
motifs can be used for binding detectable moieties, 
such as Technetium- 990 ( 99m Tc) for in vivo imaging. 
Preferred examples of useful C-terminal tail amino acid 
sequences wherein the crosslinking means is provided by 
the sulfhydryl group of a cysteine, include: Ser-Cys; 
(Gly) 4 -Cys; and (His) 6 -(Gly) 4 -Cys; set forth in the 
Sequence Listing as SEQ. ID. NOS.: 9, 10 and 11, 
respectively. The (Gly) 4 -Cys sequence facilitates the 
coordination of 99m Tc by this tail. 

In the present invention, monomeric sFv' 
molecules can be coupled together through the 
crosslinking means in the C-terminal tails to form 
either homo- or heterodimeric (sFv') 2 species. The 
term "sFv coupler", as used herein, refers to the 
chemical bridge that links two sFV polypeptide chains 
together to form a dimeric species. In a preferred 
aspect of the invention, where the crosslinking means 
is a cysteine residue, the linkage is by a disulfide 
bond. Alternatively, sulf hydryl-specif ic 
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homobifunctional crosslinking reagents, such as 
bismaleimidohexane, or heterobif unctional crosslinking 
reagents, can be used to join the two sFV molecules 
together. sFv couplers of preselected length also can 
be designed to limit interaction between the two sFv' 
polypeptide chains or to optimize binding of two 
preselected antigens, including, for example, multiple 
copies of a receptor expressed on a cell surface in a 
mammal- An example of such a variable length coupler 
includes the bismaleimidocaproyl amino acid (MCA) 
synthetic peptide bridge. Although, in a preferred 
aspect of the invention a GlySer 3 Gly 2 Ser 3 Lys peptide 
spacer is used, in theory, any amino acid sequence can 
be introduced into this type of chemical bridge with a 
variety of reactive moieties at either end. 
Consequently; it is possible to design specific linkage 
groups that can have a predetermined length and 
flexibility. If a substantially inflexible coupler is 
desired, then for instance, a poly lysine or polyproline 
peptide may be used. Another benefit of the MCA 
linkers over many other commercially available linkers 
is that they are soluble in water. Moreover, the 
chemical bridge also may be created to enhance the 
imaging or therapeutic properties of the construct in 
vivo (vide infra). As will be appreciated by those 
having ordinary skill in the art, the separation 
distance between, and interaction of, the sFv' monomers 
in a dimeric construct of the invention also can be 
modulated by the judicious choice of amino acids in the 
tail sequences themselves. 

The dimeric constructs of this invention 
preferably target a pharmacologically active drug (or 
other ancillary protein) to a site of interest 
utilizing the bivalent capability of the dimer. 



Examples of pharmacologically active drugs include 
molecules that inhibit cell proliferation and cytotoxic 
agents that kill cells. The term "cytotoxic agent", as 
used herein* refers to any molecule that kills cells, 
and includes anti-cancer therapeutic agents such as 
doxorubicin. Other, useful molecules include toxins, 
for instance, the toxic portion of the Pseudomonas 
exotoxin, phytolaccin, ricin, ricin A chain, or 
diptheria toxin, or other related proteins known as 
ricin A chain-like ribosomal inhibiting proteins, i.e., 
proteins capable of inhibiting protein synthesis at the 
level of the ribosome, such as pokeweed antiviral 
protein, gelonin, and barley ribosomal protein 
inhibitor. 

In such cases, one sFv' can be immunologically 
reactive with a binding site on an antigen at the site 
of interest, and the second sFV in the dimer can be 
immunologically reactive with a binding site on the 
drug to be targeted. Alternatively, the construct may 
bind one or more antigens at the the site of interest 
and the drug to be targeted is otherwise associated 
with the dimer, for example, crosslinked to the 
chemical bridge itself. The biosynthetic dimeric 
constructs of this invention also may be used as part 
of human therapies to target cytotoxic cells such as 
cytotoxic T-lymphocytes, or pharmacologically active 
drugs to a preselected site. A bispecific (sFv') 2 
heterodimer having specificity for both a tumor antigen 
and a CD3 antigen, the latter of which is present on 
cytotoxic T-lymphocytes, thus could mediate antibody 
dependent cellular cytotoxicity (ADCC ) or cytotoxic T- 
lymphocyte-induced lysis of the tumor cells. 

Still another bispecific dimeric construct having 
cytotoxic properties is a bispecific construct with one 
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sFV capable of targeting a tumor cell and the second 
sFV having catalytic properties that binds an inactive 
drug, subsequently converting it into an active 
compound (see for example, U.S. Patent 5,219,732). 
Such a construct would be capable of inducing the 
formation of a toxic substance in situ . For example, a 
catalytic sFv' molecule having p-lactamase-like 
activity can be designed to bind and catalyze the 
conversion of an inactive lactam derivative of 
doxorubicin into its active form. Here the bispecific 
dimer, having binding affinities for both the 
preselected antigen and the inactive-lactam derivative, 
is administered to an individual and allowed to 
accumulate at the desired location. The inactive and 
nontoxic cytotoxin-lactam derivative then is 
administered to the individual. Interaction of the 
derivative with the bispecific (sFv') 2 heterodimer at 
the site of interest releases the active form of the 
drug in situ , enhancing both the cytotoxicity and 
specificity of the drug. 

The homo- and heterodimeric biosynthetic 
constructs also may include a detectable moiety bound 
either to the polypeptide chain, e.g., to the tail 
sequence, or to the chemical coupler. The term, 
"detectable moiety", *as used herein, refers to the 
moiety bound to or otherwise complexed with the 
construct and which can be detected external to, and at 
a distance from, the site of the complex formation, to 
permit the imaging of cells or cell debris expressing a 
preselected antigen. Preferable detectable moieties 
for imaging include radioactive atoms such as 
Technetium" 9 9 m ( 99bi Tc), a gamma emitter with a half- 
life of about 6 hours. Non-radioactive moieties useful 
for in vivo magnetic resonance imaging applications 
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include nitroxide spin labels as well as lanthanide and 
transition metal ions which induce proton relaxation in 
situ . In addition to inununo imaging , the complexed 
radioactive moieties also may be used in standard 
radioimmunotherapy protocols to destroy the targeted 
cell. Preferable nucleotides for high dose 
radioimmunotherapy include radioactive atoms such as, 
90 Yttrium ( 90 Yt), 13l Iodine ( 131 I) or "'indium 



The sFv, sFV and (sFv') 2 constructs disclosed - 
herein have particular utility as in vivo targeting 
agents of tumor antigens, including antigens 
characteristic of breast and ovarian malignancies, such 
as the c-erbB-2 or c-erbB-2 related antigens. 
Accordingly, these constructs have particular utility 
in diagnostic applications as imaging agents of 
malignant cells, and in therapeutic applications as 
targeting agents for cytotoxins and other cancer 
therapeutic agents. In one preferred aspect of the 
invention, the CDRs of the sFv or sFV polypeptide 
chain have an amino acid sequence substantially 
homologous with the CDRs of the variable region of any 
one of the following monoclonal antibodies: 741F8, 
520C9, and 4 54C11, all of which bind to c-erbB-2- or c- 
erbB-2-related antigens. Exemplary sFV and sFv 
sequences having CDRs corresponding to the monoclonal 
antibodies 7 41F8 and 520C9 are set forth in the 
Sequence Listing SEQ. ID. NOS.: 1 and 5, respectively. 

The term "c-erbB-2" refers to a protein antigen 
that is an approximately 200 kD acidic glycoprotein 
having an isoelectric point of about 5.3 and having an 
extracellular domain overexpressed on the surface of 
tumor cells, such as breast and ovarian tumor cells in 
about 25% of cases of breast and ovarian cancer. A "c- 
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erbB-2-related tumor antigen" is a protein located on 
the surface of tumor cells, such as breast and ovarian 
tumor cells and which is antigenically related to the 
c-efbB-2 antigen. That is, the related antigen can be 
bound by an immunoglobulin that is capable of binding 
the c-erbB-2 antigen (e.g. 741F8, 520C9, and 454C11 
antibodies. Related antigens also include antigens 
comprising an amino acid sequence that is at least 80% 
homologous, preferably 90% homologous, with the amino 
acid sequence of c-erbB-2 or an amino acid sequence 
encoded by a DNA that hybridizes under stringent 
conditions with a nucleic acid sequence encoding 
c-erbB-2. As used herein, stringent hybridization 
conditions are those set forth in Sambrook, et al . , 
1989, Molecular Cloning ; A Laborat ory Manual 2nd ed. 
Cold Spring Harbor' Press wherein the hybridization 
conditions, for example, include 50% formamide, 5x 
Denhardt's Solution, 5xSSC, 0.1% SDS and lOOpg/ml 
denatured salmon sperm DNA and the washing conditions 
include 2xSSC, 0.1% SDS at 37 °C followed by lxSSC, 0.1% 
SDS at 68°C. An example of a c-erbB-2-related antigen 
is the receptor for the epidermal growth factor. 

'In one embodiment, the biosynthetic antibody 
binding site is a humanized hybrid molecule which 
includes CDRs from the mouse 741F8 antibody interposed 
between FRs derived from one or more human 
Immunoglobulin molecule. The CDRs that bind to the c- 
.erbB-2 epitope can be found in the amino acid residue 
numbers 31-37, 52-68, 101-110, 159-169, 185-191 and 
,224-233 in SEQ ID NOS.: 1 and 2^j The hybrid molecule 
thus contains binding sites whidtt are highly specific 
for the c-erbB-2 antigen or c-erbB-2 related antigens 
held in proper immunochemical binding conformation by 
human FR amino acid sequences, which are less likely to 



be recognized as foreign by the human body. 

The dimeric (sFv') 2 construct can either be 
homodimeric, wherein the CDR sequences on both monomers 
define the same binding site, or heterodimeric, wherein 
the CDR sequences of each sFv' monomer define a 
different binding site. An example of an (sFv') 2 
heterodimer described herein having specificity for 
both c-erbB-2 and digoxin epitopes can be generated by 
combining the anti-c-erbB-2 sFv' , shown in SEQ. ID. 
NOS.: 1 and 2 with the anti-digoxin sFV, shown in SEQ. 
ID. NOS. : 3 and 4. The CDRs that bind to the digoxin 
epitope can be derived from the anti-digoxin murine 
monoclonal antibody 26-10 (Huston et al . , 1988, Proc. 
Natl . Acad . Sci . USA 85: 5879-5883) and can be found in 
the amino acid residue numbers 32 through 36, 48 
through 65, 101 through 107, 157 through 170, 188 
through 194 and 229 through 234 in the Sequence Listing 
as SEQ. ID. NOS.: 3 and 4. 

Radio imaging or radio immunotherapy of tumor 
tissues and malignant cells are preferred aspects of 
this invention. Overexpression of tumor antigens such 
as c-erbB-2 and related cell surface antigens in 
malignant cells allows imaging of the malignant cell or 
tissue, whether it is well localized, has undergone 
metastasis or is exposed following cell lysis. The 
imaging method includes the steps of administering to a 
mammal a formulation comprising an sFv' or (sFv') 2 
dimeric construct having specificity for the antigen 
tumor and containing a detectable moiety at a 
concentration sufficient to permit extracorporeal 
detection of the construct bound to the tumor antigen; 
and then detecting the biosynthetic construct bound to 
the tumor antigen. The formulation can be used to 
particular advantage in gamma scintigraphy or magnetic 
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resonance imaging. Overexpression of c-erbB-2 or 
related receptors on malignant cells thus allows 
targeting of sFV species to the tumor cells, whether 
the tumor is well-localized or metastatic. In 
addition, internalization of an sFv-toxin fusion 
protein permits specific destruction of tumor cells 
bearing the overexpressed c-erbB-2 or related antigen. 

The present invention discloses monomeric and 
dimeric biosynthetic constructs having enhanced 
properties as in vivo targeting agents when compared 
with intact monoclonal antibodies or their Fab 
3 V » fragments^ The dimeric biosynthetic constructs of the 
fj * invention %lso permit the in vivo targeting of an 
j{ epitope on an antigen with greater apparent avidity, 

ij including greater tumor specificity, tumor localization 

J! and tumor retention properties than that of the Fab 

!l fragment having the same CDRs as the construct. 

Furthermore, the dimeric constructs also permit the in 
3 vivo targeting of an epitope on an antigen with a 

?j greater apparent avidity, including greater tumor 

3} localization and tumor retention properties, than 

2 either of the monomeric polypeptides individually. 

=l The invention also includes methods for producing 

the homo- and heterodimeric biosynthetic constructs, 
which include the steps of designing, constructing, 
expressing, purifying, and refolding the monomeric sFV 
polypeptide chains in vitro , followed by joining two 
polypeptide chains together through the crosslinking 
means in the C-terminal tail sequence, without relying 
on the tail structure to otherwise assist in dimer 
formation or enhance transport across a membrane. The 
invention also includes methods for imaging a 
preselected antigen in a mammal expressing the 
preselected antigen. The antigen may be expressed on a 



cell surface or may be released as part of the cell 
debris from a dying cell. 

The foregoing and other objects, features and 
advantages of the present invention will be made more 
apparent from the following detailed description of the 
invention. 



- 20 - 



Brief Description of the Drawings 

The foregoing and other objects of this 
invention, the various features thereof, as well as the 
invention itself, will be more fully understood from 
the following description, when read together with the 
accompanying drawings: 

FIG. 1A is a schematic representation of a DNA 
construct encoding the sFV biosynthetic binding 
protein of the invention; 

FIG. IB is a schematic representation of the 
polypeptide chain encoded by the DNA construct in FIG. 
1A; 

FIG. 2 A is a schematic representation of a 
refolded sFv' protein in its native conformation; 

FIG. 2B is a schematic representation showing two 
folded sFv' polypeptides covalently linked by a 
disulfide bond; 

FIG. 3 is a graphic representation of an in vitro 
competition assay comparing the c-erbB-2 binding 
activity of an Fab fragment of the 520C9 monoclonal 
antibody (filled dots), with that of biosynthetic 520C9 
sFv at two different stages of purification: mixture 
of folded and unfolded sFv (+) or affinity-purified sFv 
(squares), and with a material that did not bind to the 
affinity column (*); 

FIG. 4 lists in tabular form the tumor: organ 
ratios calculated for various sF and sFv' species 
injected into tumor-containing mice; 

FIG. 5 lists in tabular form the percentage of 
injected dose localized to tumor tissue for various sFv 
and sFv's species; and 

FIG. 6 is a graphic representation summarizing 
the comparative tumor retention properties of monomeric 
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and dimeric forms of different sFv' constructs and Fabs 
represented by bars 1-6. The sFV species represented 
by bars 1-5 are based on thr V regions of the 741F8 
monoclonal antibody. Bar 1 refers to intravenously 
(i.v.) administered glutathionyl-( sFv' -SerCys ) monomer, 
bar 2 to disulfide linked ( sFV -Gly 4 -Cys ) 2 , bar 3 to 
MCA combined ( sFv-Ser-Cys ) 2 , bar 4 to BMH cross-linked 
(sFv-Ser-Cys) 2/ bar 5 to 741F8 Fab and bar 6 to the 
26-10 disulfide linked ( sFv-Ser-Cys ) 9 . 




- 22 - 



Detailed Description of the Invention 

It has been discovered that intravenously 
administered single-chain Fv (sFv) proteins exhibit 
superior in vivo pharmacokinetic properties relative to 
intact monoclonal antibodies (IgG), (Fab) 2 dimers or 
Fab fragments. These pharmacokinetic properties 
include accelerated rates of tissue biodistribution, 
enhanced target tissue specificity, and exceptionally 
fast clearance rates. The sFv constructs can be 
designed to bind to preselected antigens and to have 
particular utility for in vivo immunoimaging and 
immunotherapy applications. In addition, it also has 
been discovered that dimeric forms of the constructs, 
which do not rely on self-associating tail sequences 
for dimerization or transport across a membrane, can be 
easily prepared and have improved target tissue 
localization properties, target tissue retention 
properties and/or avidity for their targets in vivo, 
relative to monomer ic sFv' , Fab fragments or intact 
IgG. 

In its broadest aspect, the invention features a 
formulation for targeting an epitope on an antigen 
expressed in a mammal. The formulation contains a 
pharmaceutical^ acceptable carrier in combination with 
a dimeric biosynthetic construct for binding at least 
one preselected antigen. The preselected antigen 
either may be an antigen expressed on the surface of a 
cell or an intracellular component exposed upon lysis 
of the cell. The sFv, sFV and (sFv') 2 constructs 
disclosed herein have particular utility as in vivo 
targeting agents for detecting malignant cells in a 
mammal. In a particularly useful embodiment, the 
constructs disclosed can be used to target the c-erbB-2 
or c-erbB-2-related antigens which are overexpressed in 
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certain breast and ovarian cancers. In another 
embodiment, radioimmunotargeting using radiolabeled 
(sFv') 2 constructs will be useful for therapeutic as 
well as diagnostic applications. 

Provided below are detailed descriptions of 
biosynthetic sFv, sFv' and (sFv') 2 dimers, useful in 
the compositions and methods of the invention, together 
with methods for their construction and administration. 
Also provided are numerous, non-limiting examples which 
demonstrate the suitability of these constructs as in 
vivo targeting reagents for diagnostic and therapeutic 
applications. More specifically, the examples 
demonstrate: the construction and expression of sFv 
polypeptides (Example 1); the renaturation, 
dimerization and purification of sFv' proteins (Example. 
2 ) ; and the immunoreactivity of the monomeric and 
dimeric sFv proteins (Example 3). 

Construction of Biosynthetic Single-chain Fv Proteins . 

Each of the sFv and sFv' proteins have amino acid 
sequences that define at least two polypeptide domains. 
The polypeptide domains are connected by a polypeptide 
linker spanning the distance between the C- terminus of 
one domain and the N-terminus of the otlier. The amino 
acid sequence of each domain includes complementarity 
determining regions (CDRs) interposed between framework 
regions (FRs), where the CDRs and FRs of each 
polypeptide chain together define a binding site 
immunologically reactive with a preselected antigen. 

In the case of the sFv' proteins, each 
polypeptide chain has an additional C-terminal tail 
amino acid sequence having a substantially non-self- 
associating structure. More specifically, this is a 
sequence that does not interact appreciably with a 
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similar sequence under physiological conditions , as is 
the case for example with the a-helical leucine zipper 
motifs found in DNA binding proteins. Each tail 
sequence also contains a means for crosslinking two 
such sFv' polypeptide chains together to form an 
(sFv') 2 dimer. The resulting (sFv') 2 dimers have 
conformations which permit the in vivo binding of the 
preselected antigen by the binding sites of each of the 
polypeptide chains. 

The sFv' constructs of this invention can be 
further understood by referring to the accompanying 
Figures 1 and 2. FIG. 1A is a schematic representation 
of the DNA construct, and FIG. IB is a schematic 
representation of the resulting encoded polypeptide 
chain. FIG. 2 is a schematic representation of the 
folded sFV monomer (FIG. 2A) and the dimeric (sFv') 2 
construct (FIG. 2B) . A single-chain Fv (sFv') 
polypeptide , shown in FIGS. 1 and 2A, comprises: a 
heavy chain variable region (V R ) 10, and a light chain 
variable region, (V L ) 14, wherein the V H and V L 
domains are attached by polypeptide linker 12. The 
binding domains defined by V L and V R include the CDRs 
2, 4, 6 and 2', 4', 6', respectively, and FRs 32, 34, 
36, 38 and 32', 34', 36' 38', respectively which, as 
shown in FIG. 2, together define an immunologically 
reactive binding site or antigenic determinant, 8. 
Furthermore, the CDRs and FRs may be derived from 
different immunoglobulins (see infra). The-sFv' 
molecules also contain a C-terminal tail amino acid 
sequence , 16, comprising an amino acid sequence that 
will not self-associate with a polypeptide chain having 
a similar amino acid sequence under physiological 
conditions, and which contains a means, 18, for the 
site-directed crosslinking of two such tail sequences. 



In a currently preferred embodiment , represented in 
FIGS. 1 and 2, the crosslinking means is the sulfhydryl 
group of a cysteine amino acid. In the monomeric form 
of the sFv' the crosslinking means, 18, may be blocked 
by a blocking group, 20. For instance, the blocking 
group may be a glutathionyl moiety when the 
crosslinking means, 18, is a cysteine amino acid. 

As will be appreciated by those having ordinary 
skill in the art, the sequence referred to herein 
generally as a n C-terminal tail" sequence, peptide 
bonded to the C-terminus of an sFv and comprising means 
for crosslinking two sFv polypeptide chains, 
alternatively may occur at the N-terminus of an sFv 
( M N-terminal tail") or may comprise part of the 
polypeptide linker spanning the domains of an 
individual sFv. The dimeric species created by the 
crosslinking of sFvs having these alternative "tail" 
sequences also are contemplated to have a conformation 
permitting the in vivo binding of a preselected antigen 
by the binding sites of each of the sFv polypeptide 
chains. Accordingly, descriptions of how to make and 
use sFv' monomers and dimers comprising a C-terminal 
tail sequence are extended hereby to include sFv 
monomers and dimers wherein the tail sequence having 
crosslinking means occurs at the N-terminus of an sFv 
or comprises part of the polypeptide linker sequence. 

The CDR and FR polypeptide segments are designed 
empirically based on sequence analysis of Fv regions of 
preexisting antibodies, such as those described in U.S. 
Patent No. 4,753,894, hereby incorporated by reference. 
Numerous examples of sFv polypeptide chains now exist 
in the art and are summarized in Huston et al . , 1993, 
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Intern . Rev . Immunol . 10: 195-217, hereby incorporated 
by reference. 

The sFv and sFv' polypeptide chains of the 
invention are biosynthetic in the sense that they are 
synthesized, transfected into a cellular host, and 
protein expressed from a nucleic acid containing 
genetic sequences based in part on synthetic DNA. 
Synthetic DNA is understood to include recombinant DNA 
made by ligation of fragments of DNA derived from the 
genome of a hybridoma, mature B cell clones, a cDNA 
library derived from natural sources, or by ligation of 
plural, chemically synthesized oligonucleotides* The 
proteins of the invention are properly characterized as 
"antibody binding sites", in that these synthetic 
single polypeptide chains are able to refold into a 
3-dimensional conformation with specificity and 
affinity for a preselected epitope on an antigen. 

A detailed description for engineering and 
producing sFv proteins by recombinant means appears in 
US Patent 5,091,513 claiming priority from U.S. S.N. 
052,800, filed May 21, 1987, assigned to Creative 
BioMolecules, Inc., hereby incorporated by reference. 
The polypeptide chains of the invention are antibody- 
like in that their structure is patterned after ^regions 
of native antibodies known to be responsible for 
antigen recognition. 
"7 The single-chain polypeptide chains of the 
invention are first derived at the DNA level. The sFv 
DNAs are preferably expressed in coli, the resulting 
polypeptide chains being solubilized from inclusion 
bodies, refolded in vitro , labeled with a detectable 
moiety, such as 99ra Tc, and dimerized to form a 
biosynthetic (sFv') 2 construct. Of course, the 
constructs disclosed herein may also be engineered for 



secretion from the host cell, for example, secretion 
into the periplasmic space of an coli cell, as 
described by Pack and Pluckthun, ( Biochem . , 1992, 3J.: 
1579-1584 ), .or into the culture supernatant of a 
mammalian cell (for example, as described by 
Traunecker, et al . , 1991, EMBO J. 10: 3655-3659). 

The ability to design the single polypeptide 
chains of the invention depends on the ability to 
identify Fv binding domains of interest, and to obtain 
the DNA encoding these variable regions. Hybridoma 
technology enables the production of cell lines that 
secrete antibodies to essentially any desired substance 
that elicits an immune response. For example, 
U.S. Patent 4,753,894 describes some monoclonal 
antibodies of interest which recognize c-erbB-2 related 
antigens on breast cancer cells, and explains how such 
antibodies were obtained. One monoclonal antibody that 
is particularly useful in targeting the c-erbB-2 
antigen is 741F8 (Bjorn et al . , 1985, Cancer Res . 45 : 
1214-1221; U.S. Patent 4,753,894). This antibody 
specifically recognizes the c-erbB-2 antigen expressed 
on the surface of various tumor cell lines, and 
exhibits very little binding to normal tissues. Other 
monoclonal antibodies that bind c-erbB-2 or related 
antigens include 520C9 and 454C11 (Frankel et al. , 
1985, Biol. Resp. Modif . 4: 273-286; Ring et al. , 
1989, Cancer Res . 49: 3070-3080, Ring et al . , 1991, 
Molec. Immunol . 28: 915-917; U.S. Patents 4,753,894 and 
5,169,774). sFv' sequences with the desired 
specificity can also be derived from phage antibody 
cloning of combinatorial V gene libraries. Such 
sequences could be based on cDNA derived from mice 
preimmunized with tumor cell membranes bearing c-erbB-2 
or related antigenic fragments, (See, for example, 
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Clackson et al, (1991) Nature 352 : 624-628). 

The process of designing DNA encoding the single 
polypeptide chain of interest can be accomplished as 
follows. Either synthetic DNA duplexes can be ligated 
together to form a synthetic gene or relevant DNA 
fragments can be cloned from libraries. In the latter 
procedure, mRNA encoding the light and heavy chains of 
the desired immunoglobulin may be isolated from 
hybridomas producing the immunoglobulin and reverse 
transcribed into cDNA. The V H and V L genes 
subsequently can be isolated by standard procedures , 
for instance , by colony hybridization of cDNA libraries 
(see for example, Sambrook et al . , eds., 1989, 
Molecular Cloning , Cold Spring Harbor Laboratories 
Press, NY) or by polymerase chain reaction (PCR) (see 
for example, Innis et aL , eds., 1990, PCR Protocols , A 
guide to methods and applications , Academic Press). 
Both procedures are well known in the art. 

Still another approach involves the design and 
construction of synthetic variable domain genes 
encoding a predetermined, specific Fv binding site. 
For example, with the help of a computer program, such 
as Compugene, one may design and directly synthesize 
native or near-native FR sequences from a first, 
antibody molecule, and CDR sequences from a second 
antibody molecule. The resulting V R and V L gene 
sequences can then be genetically linked together by 
means of a linker connecting the C-terminus of one 
chain with the N-terminus of the other. 

Practice of the invention enables the design and 
synthesis of various single-chain binding proteins, all 
of which are characterized by a region having affinity 
for a preselected epitope on an antigen. Other regions 
of the biosynthetic protein are designed with the 
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particular planned utility of the protein in mind. 
Thus, if the reagent is designed for intravascular use 
in mammals, the FRs may include amino acid sequences 
which are similar or identical to at least a portion of 
the FR amino acid sequences of antibodies native to 
that species. The amino acid sequences constituting 
the CDRs may be analogous to the sequences from a 
second, different preexisting antibody having 
specificity for the antigen of interest (e.g. a murine 
or other human IgG) . Alternatively, the CDRs and FRs 
may be copied in their entirety from a single pre- 
existing monoclonal antibody cell line or a desirable 
sFv species may be cloned from a repertoire library 
derived from preimmunized or naive animals. 

It is noted however, that the linear arrangement 
of the V L and V fl domains in the DNA sequence of Fig. 1 
is not critical. That is, although the sequence 
represented in Fig. 1A encodes a heavy chain variable 
region followed by the light chain variable region, as 
will be appreciated by those skilled in the art, the 
sFv may be constructed so that the light and heavy 
chain domains are in reverse order. 

As mentioned above, the V H and V L domains of the 
sFv are linked in the gene construct by means of. a 
linker 12 (FIG. 1A) . The linker should be at least 
long enough (e.g., about 10 to 15 amino acids or at 
least 40 Angstroms in length) to permit domains 10 and 
14 to assume their proper conformations and interdomain 
relationships. The linkers preferably comprise 
hydrophilic amino acids that assume an unstructured 
configuration under physiological conditions, and are 
free of residues having large side groups that could 
interfere with proper folding of the V H , V L , or pendant 
chains. Examples of currently preferred linkers 
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include either part or all of the amino acid sequences 
((Gly) 4 Ser) 3 and ( (Ser) 4 Gly) 3 , set forth in the 
Sequence Listing as SEQ. ID. NOS.: 1 and 8, 
respectively. The linker may also include an amino 
acid sequence homologous to a sequence identified as 
"self" by the species into which it will be introduced, 
particularly if a therapeutic application is intended. 

Considerations for Suitable C-terminal Tail Sequences. 

As mentioned above, the sFV polypeptide chains 
further comprise a C-terminal tail containing at least 
one amino acid that can be derivatized or post- 
translationally modified to enable crosslinking of two 
such sFV monomers. In preferred aspects of the 
invention, the tail sequences include one or more of 
the sequences Ser-Cys, (Gly) 4 "Cys and 

(His) 6 -(Gly) 4 -Cys, set forth in the Sequence Listing as 
SEQ. ID. NOS.: 9, 10, and 11, respectively. The C- 
terminal tails preferably do not form cc-helical 
structures which self-associate under physiological 
conditions, such as the a-helical leucine zipper motifs 
found in DNA binding proteins (O'Shea et al . , 1989, 
Science 243 : 538-542, O'Shea et al . , 1991, Science 254: 
539-544) or the four-helix bundle motifs found in 
recombinant ion channels (Hill et al . , 1990, Science 
294 : 543-546) . 

Suitable derivatizable amino acid side chains may 
be selected from the group consisting of cysteine, 
lysine, arginine, histidine, glutamate, aspartate and 
derivatives or modified forms thereof. In a preferred 
aspect of the invention, cysteine amino acids are 
incorporated into the C-terminal tail sequences as the 
crosslinking means. * 



Also envisioned to be useful are 
posttranslationally modified amino acids that can be 
crosslinked in vitro . More specifically, the glycosyl 
moieties present on glycosylated amino acids, following 
secretion out of the cell, can be covalently attached 
in vitro using bifuntional linkers on standard sugar 
chemistry (see for example, E . A. Davidson (1967) 
Carbohydrate Chemistry , Holt, Kinehart and Winston, 
NY.; W.J. Lennarz (1980) The Biochemsitry of 
Glycoproteins and Proteoglycans , Plenum Press, N.Y.). 
Particularly useful glycosylation sites include the 
sequences Asn-Xaa-Thr and Asn-Xaa-Ser , wherein Xaa is 
any amino acid. Where crosslinking of glycosyl 
moieties is contemplated, the glycosylation sequences 
need not include a cysteine. 

The tail also may comprise an amino acid sequence 
defining an ion chelation motif which can be used as 
part of a purification protocol for isolating of the 
sFv' monomers by metal ion affinity chromatography 
(e.g., by means of a (His) 6 tail on an IMAC 
chromatography column), as well as for chelating ions 

-99m 111 

of detectable moieties such as Technetium or 
Indium for in vivo imaging applications. 

sFv' Coupler Considerations. 

In the present invention, two monomer ic sFv' 
proteins are crosslinked together through their C- 
terminal tails to form an ( sFV ) 2 dimer . The term "sFv 
coupler", as used herein, refers to chemical bridges 
that join the crosslinking residues in each of the sFv' 
molecules. 

In one preferred aspect of the invention, where 
the crosslinking residue is a cysteine residue, the 
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chemical bridge can be a disulfide bond- 
Alternatively, sulfhydryl-specif ic crosslinking 
reagents can be used to join two sFv' molecules 
together. An example of such a cysteine-specif ic 
chemical bridge includes the bifunctional crosslinking 
reagent bismaleimidohexane (BMH), a water insoluble 
linker that can be obtained from Pierce, Rockford, IL. 
Other bifunctional crosslinking agents include 
heterobifunctional crosslinkers which can be used to 
join two sFv' species together where the crosslinking 
residues in each of the sFv' C-terminal tail sequences 
are different, such as, a C-terminal cysteine on one 
sFv' and a C-terminal lysine on the other. Useful 
heterobifunctional crosslinking agents include 4- 
succinimidyloxycarbonyl-methyl- ( 2-pyridyldithio ) - 
toluene (SMPT) or N-succinimidyl-3-( 2- 
pyridyldithio) propionate (SPDP), both of which can be 
obtained from Pierce, Rockland, IL. 

sFv couplers of variable length also can be 
prepared to limit steric interaction of two coupled 
sFv' proteins. An example of such an sFv coupler 
includes a peptide bridge, such as the water soluble 
bismaleimidocaproyl amino acid (MCA) linker. Although 
in a preferred aspect of the invention, an MCA- ^ 
GlySer 3 Gly 2 Ser 3 Lys-MCA linker is used, in theory, any 
amino acid sequence can be introduced into this type of 
chemical bridge-spacer group. 

Suitable MCA-peptide chemical bridges can be 
synthesized on polystyrene resins f unctionalized with 
hydroxymethylphenoxyacetic acid (HMP) to allow 
formation of free acids at the C-terminus following 
deblocking. During the synthesis of the preferred 
peptide sequence Gly-Ser 3 -Gly 2 "Ser 3 -Lys the C-terminal 
lysine is esterified to the resin and other amino acids 
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are added as N-a-Fmoc protected derivatives. DIC/ 
hydroxybenzotriazol activated amino acids are coupled 
for 90 minutes after which the N-a-Fmoc protected 
groups are deprotected with .20% piperidine in 
dimethyl formamide (DMF) . Upon completion of the 
synthesis, the peptide is cleaved from the resin and 
deprotected with 95% trif luoroacetic acid (TFA) in 
water. The crude peptide then is dissolved in 0 . 1M 
phosphate buffer pH 7 and reacted overnight at room 
temperature with maleimidocaproic acid N-hydroxy- 
succinimide ester. The resulting homobif unctional 
peptide crosslinker can be purified by reverse-phase 
HPLC, for example, on a Vydac 1 x 25 cm column using 
acetonitrile/water/TFA buffers. 

With this procedure, it is possible to generate 
linkers having specific lengths and flexibilities. 
Since polypeptides having particular secondary 
structures and flexibilities are well documented in the 
art, it is possible to judiciously design the peptide 
couplers with optimal length and flexibility to 
optimize binding to two preselected antigens on a cell 
surface. As will be appreciated by those skilled in 
the art, the separation distance between, and 
interaction of, the sFv' monomers in a dimeric , 
construct of* the invention also can be modulated by the 
judicious choice of amino acids in the tail sequences 
themselves. 

Dimer Considerations. 

Using the approaches described above, (sFv') 2 
dimers readily can be prepared wherein the resulting 
dimers either can be homodimeric, where the CDR 
sequences define the same epitope binding site, or 
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heterodimeric, where the CDR sequences of each sFV 
monomer define different epitope binding sites. 

The dimeric constructs of this invention 
preferably target a pharmacologically active drug (or 
other ancillary protein) to a site of interest 
utilizing the bivalent capability of the dimer. 
Examples of pharmacologically active drugs include 
molecules that inhibit cell proliferation and cytotoxic 
agents that kill cells. Other, useful molecules 
include toxins, for instance, the toxic portion of the 
Pseudomonas exotoxin, phytolaccin, ricin, ricin A 
chain, or diptheria toxin, or other related proteins 
known as ricin A chain-like ribosomal inhibiting 
proteins, i.e., proteins capable of inhibiting protein 
synthesis at the level of the ribosome, such as 
pokeweed antiviral protein, gelonin, and barley 
ribosomal protein inhibitor. 

In such cases, one sFV can be immunologically 
reactive with a binding site on an antigen at the site 
of interest, and the second sFV in the dimer can be 
immunologically reactive with a binding site on the 
drug to be targeted. For example, the (sFv') 2 dimers 
may have specificity for both c-erbB-2 and a 
pharmacologically active drug or cytotoxic agent. The 
resulting dimer can thus target the agent or drug to 
tissues expressing the c-erbB-2 antigen in vivo- 
Alternatively, the construct may bind one or more 
antigens at the the site of interest and the drug to be 
targeted is otherwise associated with the dimer, for 
example, by crosslinking to the chemical bridge itself. 

Other bispecific (sFv') 2 constructs having 
particular utility in targeting malignant cells, 
include constructs wherein one has specificity for a c- 



erbB-2 or related tumor antigen , and the second 
determinant has specificity for a different cell 
surface protein, such as the CD3 antigen found on 
cytotoxic T-lymphocytes. The heterodimeric (sFv') 2 
construct then could mediate antibody dependent 
cellular cytotoxicity (ADCC) or cytotoxic T-lymphocyte- 
induced lysis of the tumor cells expressing the c-erbB- 
2 antigen. 

Still another bispecific dimeric construct having 
cytotoxic properties is a bispecific construct with one 
sFV capable of targeting a tumor. cell and the second 
being a catalytic sFv' that binds an inactive drug, and 
subsequently converts it into an active compound (see 
for example, U.S. Patent 5,219,732). Such a construct 
would be capable of inducing the formation of a toxic 
substance in situ . For example, a catalytic sFv' 
molecule having 0-lactamase-like activity can be 
designed to bind and catalyze the conversion of an 
inactive lactam derivative of doxorubicin" into the 
active, cytotoxic form. Here the bispecific dimer, 
having binding affinities for both the preselected 
antigen and the cytotoxic-lactam derivative, is 
administered to an individual and allowed to accumulate 
at the desired location. The inactive, nontoxic 
cytotoxin-lactam derivative then is administered to the 
individual. When the derivative is complexed with the 
bispecific (sFv') 2 heterodimer in situ the active form 
of the drug is released, enhancing both the 
cytotoxicity and specificity of the drug. 

Hybrid sFv' Considerations . 



In a preferred aspect of the invention a 
humanized single-chain Fv is envisioned whereby the 



recombinant sFV contains CDRs of the murine 741F8 
antibody interposed between human FR sequences to 
generate a humanized c-erbB-2 binding protein. The 
humanized Fv would be capable of binding c-erbB-2 while 
eliciting little or no immune response when 
administered to a patient. A nucleic acid sequence 
encoding a humanized sFv may be designed and 
constructed as follows. 

FR regions identified by homology searches of the 
GenBank database can be introduced into an sFv of 
interest by site-directed mutagenesis to reproduce the 
corresponding human sequence. Alternatively , 
homologous human V H and V L sequences can be derived 
from a collection of PCR-cloned human V regions, after 
which the human FR sequences can be ligated with murine 
CDR regions to create humanized V L and V H genes. A 
humanized sFv hybrid thus can be created, for instance,, 
where the human FR regions of the human myeloma 
antibody are introduced between the murine CDR 
sequences of the murine monoclonal antibody 741F8. The 
resulting sFv, containing the sequences FR1-CDR1-FR2- 
CDR2-FR3-CDR3-FR4, contains a murine binding site in a 
human framework. 

By directly sequencing the DNA or RNA in a 
hybridoma secreting an antibody to a preselected 
antigen, or by obtaining the sequence from the 
literature, one skilled in the art can essentially 
produce any desired CDR and FR sequence. Expressed 
sequences subsequently may be tested for binding and 
empirically refined by exchanging selected amino acids 
in relatively conserved regions, based on observations 
of trends of amino acid sequences in data bases and/or 
by using computer-assisted modeling techniques. 
Significant flexibility in V R and V L design is possible 
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because alterations in amino acid sequences may be made 
at the DNA level. 

Of course, the processes for manipulating, 
amplifying, and recombining DNAs that encode amino, acid 
sequences of interest are generally well known in the 
art (see, for example, Sambrook et al., 1989, Molecular 
Cloning A Laboratory Manual , 2nd ed. Cold Spring Harbor 
Press), and therefore, are not described in detail 
herein. Similarly, methods for identifying the 
isolated V genes encoding antibody Fv regions of 
interest are well understood and are described in the 
patent and other literature. 

Expression of Recombinant sFv Pr oteins. 

The resulting sFv DNA constructs then are 
integrated into expression vectors and trans fected into 
appropriate host cells for protein expression. After 
being translated, the protein may be purified from the 
cells themselves or recovered from the culture medium. 

The expression vectors also may include various 
sequences to promote correct expression of the 
recombinant protein. Typical sequences include 
transcription promoters and termination sequences, 
enhancer sequences, preferred ribosome binding site 
sequences, preferred raRNA leader sequences, preferred 
protein processing sequences, preferred signal 
sequences for protein secretion, and the like. The DNA 
sequence encoding the gene of interest also may be 
manipulated to remove potentially inhibiting sequences 
or to minimize unwanted secondary structure formation. 
The resulting synthetic genes can be expressed in 
appropriate prokaryotic hosts such as various strains 
of E. coli, or in eucaryotic hosts such as Chinese 
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hamster ovary cells (CHO), mouse myeloma, hybridoma, 
transfectoma, and human myeloma cells. The currently 
preferred expression system for the present invention 
is coli , as disclosed herein. 

When the gene is to be expressed in coli, it 
is cloned into an expression vector downstream of a 
strong promoter sequence, such as Trp or Tac, and 
optionally also may include a gene coding for a leader 
polypeptide, such as the fragment B (FB) of 
staphylococcal protein A. The resulting fusion 
protein, when expressed, accumulates in refractile 
bodies (also known as inclusion bodies) in the 
cytoplasm, and may be harvested after disruption of the 
cells by French press or sonication. The proteins then 
are solubilized, and refolded in vitro , as described 
herein. Where the construct is . engineered as a fusion 
protein, the protein is solubilized and the leader 
sequence preferably cleaved before renaturation. The 
cleavage site for the leader sequence preferably is 
immediately adjacent to the sFv polypeptide chain and 
includes one amino acid or a sequence of amino acids 
exclusive of any one amino acid or amino acid sequence 
found in the amino acid structure of the single 
polypeptide chain. 

The cleavage site preferably is designed for 
specific cleavage by a selected agent. Endopeptidases 
are preferred, although non-enzymatic (e.g., chemical) 
cleavage agents may be used. Many useful cleavage 
agents, for instance, cyanogen bromide (CNBr), dilute 
acid, trypsin, Staphylococcus aureus V- 8 protease, 
post-proline cleaving enzyme, blood coagulation 
Factor Xa, enterokinase, and renin, recognize and 
preferentially or exclusively cleave at particular 
cleavage sites. One currently preferred peptide 
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sequence cleavage agent is V-8 protease. The currently 
preferred cleavage site is at a Glu residue. Other 
useful enzymes recognize multiple residues as a 
cleavage site, e.g., factor Xa ( Ile-Glu-Gly-Arg) or 
enterokinase ( Asp-Asp-Asp-Asp-Lys ) . Dilute acid 
preferentially cleaves the peptide bond between Asp-Pro 
residues, and CNBr in acid cleaves after Met, unless it 
is followed by Tyr. 

Alternatively, the engineered gene may be 
incorporated into a vector without a sequence encoding 
a leader polypeptide, and the engineered gene expressed 
to produce a polypeptide chain that is secreted into 
the E. coli periplasmic space. The secreted protein 
then can be isolated and, optionally, purified further 
using standard methodologies. (See, for example, Pack 
et al. (1992) Biochem 31 ; 1579-1584 . ) 

If the engineered gene is to be expressed in 
eucarybtic hybridoma cells, the conventional expression 
host for immunoglobulins, the gene preferably is 
inserted into an expression vector containing, for 
example, the immunoglobulin promoter, a secretion 
signal, and immunoglobulin enhancers. This plasmid 
also may contain sequences encoding other polypeptide 
chains, including part or all of a toxin, enzyme, 
cytokine, or hormone. The gene then is trans fected 
into myeloma cells via established electroporation or 
protoplast fusion methods. The transfected cells then 
may express V H -linker-V L -tail or- V L -liftker-V H -tail 
single-chain FV polypeptide chains. 

The sFv polypeptide chains can be expressed as 
either inactive or active polypeptide chains. 
Spontaneously refolded sFv polypeptide chains can be 
obtained from either prokaryotic or eukaryotic 
expression systems when the polypeptide chains are 
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secreted for instance, either into the coli 
periplasraic space or the mammalian cell culture medium. 
These spontaneously refolded polypeptide chains readily 
can be purified by affinity chromatography. Where the 
sFv polypeptide chains are obtained in an unfolded, 
inactive sFv form, for instance, when overexpression of 
the sFv polypeptide chain in coli results in the 
formation of inclusion bodies, the proteins can be 
refolded in vitro . Briefly, inclusion bodies are 
harvested by centrifugation, the sFv, solubilized with 
denaturants such as guanidine hydrochloride (GuHCl) or 
urea, and then refolded by dilution of the denaturant 
under appropriate redox (reduction/oxidation) 
conditions (see below). The refolded sFv polypeptide 
chains then can be purified by affinity chromatography. 
Details for the isolation of inclusion bodies, 
solubilization and renaturation of the sFv polypeptide 
chains are well known in the art (see for example, US 
Patent 5,091,513 and Huston et al. , 1988, supra). 

Dimerization and Purification of the sFv Polypeptides 

The sFV monomers of the present invention can be 
dimerized in vivo or in vitro , in the in vivo - 
approach, two sFv' genes can be cotransf ected into the 
host cell wherein the coexpressed sFv' polypeptide 
chains spontaneously dimerize. Alternatively, the 
refolded, secreted sFv' polypeptide chain monomers can 
be isolated from two expression hosts and subsequently 
dimerized in vitro . 

In a preferred aspect of the invention, the sFv' 
polypeptide chains comprising a single cysteine C- 
terminal tail residue are expressed in coli and form 
inclusion bodies. The resulting sFV polypeptide 
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chains are solubilized with denaturants and renatured 
in vitro , either in the presence or absence of 
exogenously added glutathione. Surprisingly, the 
additional C-terminal cysteine residues apparently do 
not interfere with the refolding process. In some 
cases however, sFv and sFV constructs may refold 
poorly in vitro . These constructs can be "preoxidized 
prior" to refolding as taught in Huston et al. , .( 1991 ) 
Meth. Enzymol . 203:46-88, or, alternatively, the 
polypeptide chains can be secreted across a membrane 
bilayer. The latter process spontaneously separates 
the extra C-terrainal cysteine residue from the cysteine 
residues normally found in the Fv domain, minimizing 
inappropriate disulfide bond formation. Secretion is 
the preferred method if the sFV constructs refold 
poorly in vitro . 

Following renaturation of the sFv' monomers, 
(sFv') 2 dimers readily can be prepared in vitro by air 
oxidation if cysteine amino acids are present in the 
C-terrainal tail sequences. Alternatively, sulfhydryl 
specific crosslinking reagents, for instance, the BMH 
crosslinker or the MCA-peptide-MCA bridge may be used 
to covalently couple two sFv' chains. The resultant 
homo or heterodimers , then can be purified by standard 
size exclusion chromatography. However, when (sFv) 2 
heterodimers are required, then a preferred 
purification protocol uses a sequential two step 
affinity chromatography procedure. Briefly, the 
heterodimer is exposed to a first chromatographic 
system having an epitope that interacts specifically 
with one sFv of the heterodimer. The eluant containing 
the heterodimer is then exposed to a second system 
having an epitope that interacts specifically with the 
other sFv. For specific details of the dimerization 
and purification procedures, see Example 2. 
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Considerations for In Vivo Administration. 

The dimeric constructs may be administered either 
by intravenous or intramuscular injection. Effective 
dosages for the single-chain Fv constructs in antitumor 
therapies or in effective tumor imaging can be 
determined by routine experimentation, keeping in mind 
the objective of the treatment. 

The pharmaceutical forms suitable for injection 
include sterile aqueous solutions or dispersions. In 
all cases, the form must be sterile and must be fluid 
so as to be easily administered by syringe. It must be 
stable under the conditions of manufacture and storage, 
and must be preserved against the contaminating action 
of' microorganisms. This may, for example, be achieved 
by filtration through a sterile 0.22 micron filter 
and/or lyophilization followed by sterilization with a 

gamma ray source. 

Sterile injectable solutions are prepared by 
incorporating the desirable amount of the constructs, 
disclosed herein, into an appropriate solvent, such as 
sodium phosphate-buffered saline (PBS), followed by 
filter sterilization. As used herein, "a 
physiologically acceptable carrier" includes any and 
all solvents, dispersion media, antibacterial and 
antifungal agents that are non-toxic to humans, and the 
like. The use of such media and agents as 
pharmaceutical^ active substances are well known in 
the art. The media or agent must be compatible with 
maintenance of proper conformation of the single-chain 
Fv polypeptide chains, and its use in the therapeutic 
compositions. Supplementary active ingredients can 
also be incorporated into the compositions. 
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A preferred remotely detectable moiety for in 
vivo imaging includes the radioactive atom 
Technetium" 9 9 m ( 99ra Tc), a gamma emitter with a half- 
life of about 6 hours- Non- radioactive moieties also 
useful in imaging include nitroxide spin labels as well 
as lanthanide and transition metal ions all of which 
induce proton relaxation in situ . In addition to 
immunoimaging, the complexed radioactive moieties may 
be used in standard radio immunotherapy protocols to 
destroy the targeted cell. Preferred nucleotides for 
high dose radio immunotherapy include the radioactive 
S atoms 9 5 Yttrium ( 90 Yt), 13 iodine ( 131 I) and 11 indium 

( 1X1 In). 

Either the single polypeptide chain sFV itself, 

or the spacer groups for linking the sFV constructs 

.. v. ni T in Tn 

can be labeled with radioisotopes such as I, in 

and" 99m Tc. 99m Tc and 1 1 1 In are preferred because they 

can be detected with gamma cameras and have favorable 

half-lives for in vivo imaging applications. The 

single polypeptide chains can be labeled, for example, 

with radioactive atoms such as 9 °Ty, 99M Tc or I via 

a conjugated metal chelator (see, e.g., Khaw et al. 

,1980, Science 209: 295; U.S. Patent No. 4,472,509; 

U.S. Patent No. 4,479,930), or by other standard means 

of linking isotopes to proteins, known to those with 

skill in the art (see for example, Thankur et al . , 

1991, J. Immunol . Methods 237 : 217-224). 
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The invention is illustrated by the following 
Examples, which are not intended to be limiting in any 
way . 



EXAMPLES 



Example 1. Synthesis and Expression of the sFv 
Constructs (741F8, 26-10 and 520C9) . 

The construction of several sFv genes using 
different but standard recombinant DNA technology, well 
known to those having ordinary skill in the art, is 
described below. These procedures include the 
amplification of the V R and V L gene sequences by PCR, 
the ligation of appropriate synthetic DNA duplexes and 
the cloning of V H or V L genes by colony hybridization. 

A. 741F8 sFv' . 

The V R and V L genes of the 741F8 anti-c-erbB-2 
monoclonal antibody were isolated from the cDNA of the 
parental 741F8 hybridoma line by PCR using primers 
homologous to the N-terminal coding regions of V fl , V L , 
C H 1, and C L . The PCR-amplif ied V fl and V L genes were 
isolated by polyacrylamide gel electrophoresis and 
cloned into a pUC cloning vector. The first FR region 
of the 741F8 V„ gene however contained spurious 
mutations due to the PCR procedure. Errors were 
rectified by the replacement of the first 70 
nucleotides of 741F8 V fl with a similar sequence from 
520C9 V„, another c-erbB-2 specific monoclonal 
antibody. 

Restriction sites then were introduced into the 
ends of the heavy and light chain variable gene 
segments by site-directed mutagenesis (Kunkel et al. , 
1985, Proc. Natl. Acad. Sci. USA 82: 488-492). A Nco I 
site encoding methionine was positioned at the N- 



terminus of V fl for expression in coli. A Sac I site 
was created at the 3' end of V R gene. A Xho I site, 
together with an adjacent Eco RV site, were created at 
the N-terminus of V L - A stop codon and a Pst I site 
were placed at the C-terminal end of V L< 

The single-chain Fv gene was constructed by 
connecting the V H and V L genes together with a DNA 
sequence encoding the 14 residue polypeptide linker, 
(Ser 4 Gly) 2 Ser 4 , as set forth as amino acids 122 through 
135 in the Sequence Listing as SEQ. ID. NOS.: 1 and 2. 

A synthetic DNA duplex encoding the C-terminal 
amino acid sequence, (Gly) 4 ~Cys was inserted into a Hpa 
I site located near the stop codon at the 3' end of the 
741F8 sFv gene. The resulting 741F8 anti-c-erbB-2 sFv' 
gene was excised from the pUC cloning vector, with the 
restriction enzymes Nco I and Bam HI (a Bam HI site is 
located 3' to the C-terminal Pst I site), and inserted 
into the same sites of a commercial T7 expression 
vector pET-3d ( In-vitrogen, Inc.). The resulting gene, 
set forth in the Sequence Listing as SEQ. ID. NOS.: 1 
and 2, was transformed into E^ coli BL21-DE (In- 
vitrogen, Inc.). Protein expression was induced by the 
addition of IPTG to the culture medium. 

B. 26-10 sFv' 

Construction of the anti-digoxin 26-10 sFv has 
been described previously (Huston et al., 1988, Proc. 
Natl . Acad . Sci. USA 85; 5879-5883, and US Patent 
5,091,513, both of which are hereby incorporated by 
reference). Briefly, the synthetic gene was 
constructed by ligating multiple synthetic DNA duplexes 
together. The C-terminal DNA duplex coding for the 
amino acid sequence (Gly).-Cys subsequently was ligated 



into a Hpa I restriction site close to the 3' end of 
the 26-10 sFv gene. The resulting sFv' gene, set forth 
in- the Sequence Listing as SEQ. ID. NOS. : 3 and 4, was 
then inserted into the coli expression vector pET- 
3d. This plasmid was subsequently transformed into E^ 
coli BL21-DE (In-vitrogen, Inc.) and protein expression 
induced by the addition of IPTG to the culture medium. 

C. 520C9 sFv . 

The 520C9 sFv was generated by linking together 

the V and V T genes, cloned from a 520C9 hybridoma cDNA 
H L 

l-ibrary, with a serine rich linker. Briefly, the V H 
and V T genes were cloned from the 520C9 hybridoma cDNA 

library using probes directed toward the antibody 
constant (C) and joining (J) regions- Appropriate 
restriction sites were introduced at the ends of each 
gene by site-directed mutagenesis (Kunkel et al . , 1985, 
Froc. Natl . Acad. Sci. USA 82: 488-492). The V R and V L 
genes were then ligated together with a serine rich 
linker./) The resulting 520C9 sFv gene, set forth in the 



Sequence Listing as SEQ. ID. NOS.: 5 and 6, was 
transformed into the E^ coli expression vector and 
expressed as described above and in co-pending U-SSN 
831,967, incorporated therein by reference. 

EXAMPLE 2. Renaturation, Dimerization and Puri fication 
of sFv Proteins. 

A. Renaturation and Purification of sFv monomers . 

Protocols for renaturing sFv monomers derived 
from E^ coli inclusion bodies are described below. In 
separate experiments the 7418, 26-10 and 520C9 sFv 
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polypeptides were expressed in E. coli . The unfolded 
sFv proteins were solubilized from inclusion bodies and 
refolded under appropriate redox conditions. The 
refolded sFv polypeptide chains were purified by 
affinity chromatography or by a combination of ion- 
exchange and size exclusion chromatography when 
affinity chromatography was not feasible or expedient. 

Renaturation of 741F8 sFv' . 

Inclusion bodies containing the 741F8 sFV 
proteins were washed in a buffer containing 25 mM Tris, 
10 mM EDTA, 1.5 M GuHCl, pH 8 . 0 and solubilized in 25 
mM Tris, 10 mM EDTA, 7 M GuHCl, pH 9.0 to an OD 280 nm 
of about 25-50. The sample was reduced overnight at 
room temperature by the addition of dithiothreitol 
(DTT) to a final concentration of 10 mM. The thiol 
groups were converted into mixed disulfides with 
glutathione by the addition of solid oxidized 
glutathione to a final concentration of 100 mM. The 
solution was adjusted to pH 9.0 and incubated for 4 hr 
at room temperature. The 741F8 sFv' polypeptide chains 
then were refolded in vitro to generate stable monomers 
with their C-terminal cysteines remaining blocked with 
glutathione. The 741F8* sFv' mixed disulfide 
preparation was diluted to an OD og0 of about 0.15 by 
the addition of 10 mM Tris, 4 mM EDTA, 6 M urea, pH 8.5 
at 4°C. After two hours an equal volume of lOmM Tris, 
4 mM EDTA, 1 mM reduced glutathione, pH 8.5, precooled 
to 4°C, was added with rapid mixing to reduce the urea 
concentration to 3 M. After dilution, the samples were 
allowed to renature for 7 2 hr at 4°C. 



Renaturation of 26-10 sFv' . 



Inclusion bodies containing the 26-10 sFv' 
proteins were washed with 25 mM Tris, 10 mM EDTA and 
solubilized in 6 M GuHCl, 25 mM Tris, 10 mM EDTA, pH 
8.7 to an OD OQn m of about 10 to 20. The dissolved 
proteins were reduced by overnight incubation at room 
temperature after the addition of DTT to 10 mM. The 
reduced protein could also be blocked with oxidized 
glutathione as noted above for the 741F8 sFv' 
polypeptide. The reduced, denatured 26-10 sFV 
polypeptides were refolded in a manner similar to that 
for the 741F8 sFv' by diluting the preparation into a 
buffer containing 3 M urea, 0.1 mM oxidized and 0.01 
mM reduced glutathione to give a final protein 
concentration of about 0.15 mg/ml. After overnight 
incubation at 4°C, the mixture was dialyzed against PBS 
containing 0.05 M KH 2 P0 4 , 0.15 M NaCl, pH 7 for two 
days at 4°C. 

Renaturation of 520C9 sFv. 

The inclusion bodies containing the 520C9 sFv 
were washed with 25 mM Tris, 10 mM EDTA, pH 8.0„ 1 M 
GuHCl and solubilized in 25 mM Tris, 10 mM EDTA, 6 M 
GuHCl, 10 mM dithiothreitol (DTT), pH 9.0. The 
material was ethanol precipitated and resuspended in 25 
mM Tris, 10 mM EDTA, 6M urea, 10 mM DTT, pH 8.0 and 
fractionated by ion exchange chromatography to remove 
contaminating nucleic acids and E*_ coli proteins before 
renaturation of the sFv. The material that did not 
bind to a DEAE Sepharose Fast Flow (FF) column was 
precipitated by lowering the pH to 5.5 with 1 M acetic 
acid. The pellet was resolubilized in 25 mM Tris, 10 
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mM EDTA, 6 m GuHCl, 10 niM DTT, pH 9.0 and oxidized by 
overnight incubation at room temperature following 
dilution into a buffer containing 25 mM Tris, 10 mM 
EDTA 6 M GuHCl, 1 mM oxidized glutathione, 0.1 mM 
reduced glutathione, pH 9.0. After overnight oxidation 
the sample was dialyzed against 10 mM NaH 2 P0 4 , 1 mM 
EDTA, 150 mM NaCl, 500 mM urea, pH 8.0 and the sample 
clarified by filtration through a membrane with a 100 
kD mol. wt. cut-off prior to purification on a c-erbB-2 
affinity column. 

Purification of the refolded sFv Polypeptides . 

The refolded 26-10 sFV polypeptide chains were 
purified by ouabain-Sepharose affinity chromatography, 
as described for the 26-10 sFv constructs (Huston, et. 
al. , 1988, Proc. Natl Acad. Sci. USA 85; 5879-5883 and 
Tai, et al. , 1990, Biochem. 29, 8024-3080, both of 
which are hereby incorporated by reference ) . The 
refolded 520C9 sFv polypeptide chain was similarly 
purified using a c-erbB-2-agarose affinity column. In 
this case, the refolded samples were loaded onto a c- 
erbB-2 affinity column, the column washed with PBS, and 
the 520C9 sFv polypeptides eluted with PBS pH 6U. 
containing 3 M LiCl. The buffer was then exchanged by 
dialysis. The c-erbB-2 affinity column preferably was 
prepared by linking the extracellular domain of c-erbB- 
2 onto agarose beads . 

Briefly, the c-erbB-2 sequence coding for its 
extracellular domain (ECD) was derived from the 
baculovirus expression vector described previously 
(Ring et al . , 1992, Mol. Immunol . 28; 915-917). A DNA 
duplex encoding the His g peptide was ligated to the 3' 
end of the ECD gene, and the construct expressed in CHO 
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cells. The ECD polypeptide was purified from the CHO 
cell culture medium on an IMAC metal affinity column 
(Pharmacia, Piscataway, NJ), as described in Skerra, et 
al., 1991, Bio/Technology 9: 273-278, and the eluted 
ECD proteins attached onto agarose beads to generate 
the c-erbB-2-agarose affinity resin. 

The renatured 741F8 sFv' polypeptides were 
purified by a combination of ion exchange, and size 
exclusion chromatography. Briefly, the renatured 741F8 
sFv' preparation was passed through a DEAE-cellulose 
column and the 741F8 sFv' in the unbound fraction 
adjusted to pH 5.0 before loading on an S-Sepharose FF 
column. The 741F8 sFv' polypeptide chains were eluted 
with PBS containing 2 mM EDTA and 3 M urea, and 
dialyzed against 10 mM Tris, 2 mM EDTA, 20 mM NaCl/pti 
7.5 at 20° C. The precipitate was harvested by 
centrifugation, dissolved in a suitable buffer, and 
passed through a Q-Sepharose FF column. The unbound 
material was adjusted to pH 5.5 and reloaded onto a S- 
Sepharose FF column. The 741F8 sFV polypeptides were 
eluted with a PBS, 2 mM EDTA, 100 mM NaCl, 3 M urea 
buffer and dialyzed against PBS, 2 mM EDTA. The 
precipitate was harvested again by centrifugation, 
dissolved in a suitable buffer, sucrose added ta 5% 
(w/v), and the 741F8 sFv' concentrated to 5 mg/ml in a 
YM10 membrane concentrator (Amicon) . The 741F8 sFV 
polypeptide chains were fractionated by gel filtration 
chromatography using a S-200 HR column (Pharmacia LKB 
Biotechnology) and a PBS, 2 mM EDTA buffer. 

B. Dimerization of the sFv' constructs 

Dimerization of sFv' monomers can be induced 
using standard crosslinking conditions. Where 
disulfide bond formation is desired, the monovalent 
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sFv' polypeptide chains initially are deblocked by mild 
reduction and (sFv') 2 dimers formed by crosslinking the 
sFv' polypeptides either by disulfide linkages or by 
thioether linkages with the BMH or MCA-peptide-MCA 
crosslinking reagents. 

In order to generate disulfide linked constructs 
the purified 741F8 and 26-10 sFV preparations were 
dialyzed against 50 mM Tris, 150 mM NaCl, pH 8.5. The 
C-terminal glutathionyl blocking groups were removed by 
the addition DTT to a concentration of 2 mM followed by 
overnight incubation at room temperature. Excess 
reducing agent was removed by extensive dialysis 
against 50 mM Tris, 150 mM NaCl, pH 8.5, during which 
the majority of the sFV polypeptides oxidized into the 
homodimeric form. 

In order to generate BMH and MCA-peptide-MCA 
crosslinked constructs, sFv' polypeptide chains in PBS 
first were reduced for two hours at room temperature by 
the addition of DTT to a final concentration of 1 mM. 
The samples were desalted by gel filtration 
chromatography using a PBS, 1 mM EDTA buffer. A 4-5 
fold molar excess of either the BMH or MCA-peptide-MCA 
linkers, both dissolved in dimethylsulf oxide, were 
added to the reduced protein and incubated for at least 
12 hours at room temperature. The resulting dimers 
were then purified by HPLC gel filtration 
chromatography . 

A modification of the procedure of Brennan, et 
al. (1985, Science 229: 81-83) may be used to generate 
disulfide linked sFv' heterodimers . For example, in 
order to link the 741F8 and 26-10 sFV polypeptides a 
thionitrobenzoate (TNB) derivative of the 26-10 sFv' 
(26-10 sFv'-TNB) was mixed with mildly reduced 741F8 
sFv' . The 26-10 sFv'-TNB was prepared by reducing the 



26-10 sFv' in PBS with 15 mM 2-raercaptoethylamine for 
30 minutes at room temperature. The reducing agent was 
removed by gel filtration and the reduced 26-10 sFV 
reacted with 2.2 mM dithionitrobenzoate (DTNB) for 3 
hours. The active 26-10 sFv'-TNB was adsorbed onto 
onto ouabain-Sepharose. The glutathionyi blocked 
741F8 sFv' monomer in 25 mM Tris, 150 mM NaCl, pH 8.2 
was reduced for 2 hours at room temperature by the 
addition of DTT to a final concentration of 1 mM. The 
excess DTT was removed by gel filtration and the 
reduced 741F8 sFv' reacted overnight at room 
temperature with the 26-10 sFv'-TNB complexed to 
ouabain-Sepharose. The progress of the reaction was 
monitored spectroscopically at 412 nm, the absorbance 
maximum of the TNB anion. 

C; Purification of (sFv') 2 Dimers. 

The (sFv') 2 homodimers may be separated from the 
sFv' monomers by gel filtration chromatography. 
Following dimerization, the sFV preparations are 
dialyzed against PBS containing 1 mM EDTA, 3M urea, 
0,03% azide, to disrupt any non-covalent homodimers and 
fractionated by HPLC on a TSK-G20000SW column using the 
same buffer. The procedure requires two passes for 
purification of the (sFv') 2 homodimers to homogeneity. 
The purified homodimers may be dialyzed either against 
PBS or any other suitable buffer prior to use. 

The (sFv') 2 heterodimers can be separated by a 
two step affinity chromatography procedure taking 
advantage of the bivalent nature of the dimer. For 
instance, during the the purification of the 741F8/26- 
10 heterodimer the mixture initially was loaded onto an 
ouabain-Sepharose column, washed with a PBS, 1 M NaCl 



buffer, to remove any non-specif ically adsorbed 
material, and rewashed with PBS to reduce the salt 
concentration. The reactive 26-10 sFV species bound 
to the resin were eluted with 20 mM ouabain in PBS and 
the eluate dialyzed against PBS to remove the cardiac 
glycoside. The 741F8/26-10 heterodimers were then 
repurified on a c-erbB-2-agarose affinity column taking 
advantage of the ECD binding site in the heterodimer. 
After the preparation was loaded onto the c-erbB-2 
affinity column, it is washed with PBS and the (sFv') 2 
heterodimer eluted with 25 mM Tris, 10 mM EDTA, 5M 
LiCl, pH 6.8. Prior to use, the buffer was exchanged 
with PBS by dialysis. 

EXAMPLE 3. Immunoreactivity of th e Monomeric and 
Dimeric sFv Polypeptides^ 

A. Radiolabeling of the sFv' Constructs. 

The sFv' polypeptides may be labeled by the 
chloramine-T method as described (DeNardo, et al . , 
1986, Nucl. Med. Biol^, 13; 303-310). Briefly, 1.0-2.0 
mg of sFV was combined with 125 I [14-17 mCi/pg] 
(Amershara, Arlington Heights, IL) at an iodine to 
protein ratio of 1 : 10 in a 12 x 75 mm plastic test 
tube. 10^1 [1 mg/ml] of chloramine-T (Sigma, St. 
Louis, MO) per 10C fjg of protein was added and the 
mixture incubated for three minutes at room 
temperature. After the reaction was terminated, 
unincorporated 12 5 1 was separated from the labeled sFV 
by the spun-column method of Meares, et al . , 1984, 
Anal. Biochem. 142: 68-78. Specific activities of 0.2- 
1.0 mCi/mg for the l25 I-labeled products may be 
routinely obtained. 
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B. Competition ELISA 

in order to prepare c-erbB-2, SK-Br-3 breast 
cancer cells (Ring etal., 1989, Cancer Res .. 49: 3070- 
3080), were harvested and resuspended in lOmM NaCl, 
0.5% Nonidet-P40, P H 8. Insoluble debris was removed 
by centrifugation and the extract filtered through 0.4 5 
Millex HA and 0.2 Millex GV filters. 40/jl of the 
extract was added to each well of a 96 well plate and 
incubated overnight at 37 °C. The plates then were 
washed with PBS and non-specific binding sites blocked 
following the addition of PBS containing 1% skim milk 
by incubation for one hour at room temperature. The 
sFv and 520C9 Fab samples, diluted in PBS, were added 
to the wells and incubated for 30 mins at room 
temperature. A control containing only dilution buffer 

was also included. 

in order to quantitate the reaction, 20pl of a 
520C9-horseradish peroxidase ( HRP ) probe (Zymed Labs., 
South San Francisco, California), diluted to 14pl/ml m 
PBS containing 1% skim milk, was added to each well and 
incubated for one hour at room temperature. The plate 
was then washed four times with PBS, the peroxidase 
substrate added and incubated for 30 minutes at room 
temperature. The reaction was quenched with H 2 S0 4 and 
the OD 150mn values measured. 

FIG. 3 compares the binding ability of the 
parental 520C9 Fab fragment, together with the 520C9 
sFv single-chain binding protein. The 520C9 sFv 
samples included the material obtained following 
renaturation of the polypeptide in vitro, a sample 
purified on a c-erbB-2 agarose affinity column, and the 
material that did not bind to the column. The fully 



purified 520C9 sFv polypeptide exhibits an affinity for 
c-erbB-2 indistinguishable from the parent 520C9 Fab 
fragment. 

C. Biodistribution Studies. 

In vivo immunotargeting tissue imaging studies 
were performed using standard procedures. 
Approximately 2.5 x 10 6 SK-OV-3 cells (a human ovarian 
cancer cell line that expresses c-erbB-2 on the cell 
surface) in log phase were implanted subcutaneous ly 
onto the hips of four to six week old C ,Bl7/ICI-scid 
mice. Three days after Lugol's solution was placed in 
the drinking water to block the accumulation of 
radio iodine in the thyroid, the mice were used in the 
biodistribution assays. 

The radiolabeled sFv' and Fab preparations were 
diluted in PBS for these studies. The biodistribution 
of the glutathionyl-blocked 741F8 sFV monomers, and 
the 741F8 and 26-10 (sFv') 2 constructs were compared 
after identical doses of the radiolabeled protein was 
administered by injection in each case. The total 
injected doses were determined by counting each animal 
on a Series 30 multichannel analyzer/probe system 
(probe model #2007, Canaberra, Meridian, CT) . Groups 
of 3-6 mice were sacrificed twenty four hours after 
injection, the tumors and organs were removed, weighed 
and counted in a gamma counter to determine the amount 
of radiolabel incorporated into the tissues. From 
these measurements, the percentage of the initial 
injected dose incorporated per gram of tissue 
(% ID/gram) or the amount of label incorporated into the 
tumor relative to the amount of radiolabel incorporated 
into the other organs (T:0 ratio) were determined. For 



specific details see DeNardo, et al. , 1977, Cancer , 4_0: 
2923-2929, or Adams, et al., 1992, Antibody , 
Immunoconjugates, and Radiopharmaceuticals !5: 81-95, 
both of which are hereby incorporated by reference. 
Specificity indices also can be determined by dividing 
the T:0 ratios of the 125 I-741F8 sFV by the 
corresponding T:0 ratios of the 125 I-26-10 sFV . The 
results of the biodistribution studies 24 hours post 
administration are summarized in Figs. 4 and 5. The 
mean standard error (SEM) for each value is less than 
30%, except where indicated. 

The disulfide linked 741F8 (sFv') 2 homodimers 
exhibit identical tumor specificities when compared to 
the monomeric 741F8 sFv' polypeptide chains. The T:0 
ratios of the 741F8 sFv' constructs consistently exceed 
those for the 26-10 sFv' constructs, demonstrating the 
binding specificity of the 741F8 constructs for the 
tumors (FIG. 4). In addition, the 741F8 (sFv') 2 dimers 
generally exhibit higher T:0 ratios relative to that of 
the monomeric species, particularly for the disulfide 
bonded sFV 741F8 ( sFv' - (Gly ) 4 Cys ) 2 and the MCA linked 
741F8 (sFv') 2 homodimers. In addition, the 741F8 
(sFv') 2 homodimers localize in greater amounts in the 
tumors relative to the monomeric sFv' species (£IG- 5). 

12 5 

In a separate comparative study with I-labeled 

12 5 

26-10 (sFv') 2 and the following species of I-labeled 
741F8: sFv f monomers, Fab, disulfide linked (sFv'- 
Gly 4 Cys) 2 homodimers, and MCA- and BMH-linked (sFv') 2 
homodimers, the in vivo tumor localization properties 
of these molecules were compared (%ID/gram tumor 
tissue, see Fig. 6). As is evident from the figure, 
the tumor localization properties of all of the dimeric 
741F8 (sFv') 2 constructs are significantly greater than 
those observed with the 741F8 Fab, the 741F8 sFV 



monomer and the 26-10 (sFv') 2 dimer (FIG. 6). The 
results demonstrate that the increased apparent avidity 
and enhanced in vivo imaging of the (sFv') 2 dimer is 
due, at least in part, to its improved retention in 
tumor tissue. 

EMBODIMENTS 



The invention may be embodied in other specific 
forms without departing from the spirit or essential 
characteristics thereof. The present embodiments are 
therefore to be considered in all respects as 
illustrative and not restrictive, the scope of the 
invention being indicated by the appended claims rather 
than by the foregoing description, and all changes 
which come within the meaning and range of equivalency 
of the claims are therefore intended to be embraced 
therein. 
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SEQUENCE LISTING 

1) GENERAL INFORMATION: 

(i) APPLICANT: Huston, James S. 

\ Houston, L. L. 
\Ring, David B. 
Oppermann, Hermann 

(iij TITLE OF INVENTION: Biosynthetic Binding Proteins For 
Imaging 

(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE \aDDRESS : hpault/Patent Department 

(A) ADDRESSEE: \Testa, Hurwitz & Thibeault/ patent ue V 

(B) STREET: Exchange Place, 53 State Street 

(C) CITY: Bostoft 

(D) STATE: Massachusetts 

(E) COUNTRY: USA\ 

(F) ZIP: 02109 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM EC compatible 
C OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patenkn Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER - 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATIC 

(A) NAME: Kelley, Robin 

(B) REGISTRATION NUMBER: 34,637 

(C) REFERENCE/DOCKET NUMBER: 2054/22 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-248-7477 x 

(B) TELEFAX: 617-248-7100 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 909 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE :\ 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.-752 
(D) OTHER INFORMATION: 

r4-Cys" 



/product= "741F8 sFv' C- terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CC ATG GCG GAG ATC CAa\ TTG GTG CAG TCT GGA CCT GAG CTG AAG AAG 
Met Ala Glu He Gln\Leu Val Gin Ser Gly Pro Glu Leu Lys Lys 
1 5\ 10 15 

CCT GGA GAG ACA GTC AAC! ATC TCC TGC AAG GCT TCT GGG TAT ACC TTC 
Pro Gly Glu Thr Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr Phe 
20 1 25 30 



ACA AAC TAT GGA ATG AAC 
Thr Asn Tyr Gly Met Asn 
35 



_ GTG AAG CAG GCT CCA GGA AAG GGT TTA 
rrp Val Lys Gin Ala Pro Gly Lys Gly Leu 
1 40 45 



AAG TGG ATG GGC TGG ATA 
Lys Trp Met Gly Trp He Asr 
50 

GAA GAG TTC AAG GGA CGG TTT 
Glu Glu Phe Lys Gly Arg Phe 
65 70 



ACC AAC ACT GGA GAG CCA ACA TAT GCT 
Thr Asn Thr Gly Glu Pro Thr Tyr Ala 
55 60 

SCC TTC TCT TTG GAA ACC TCT GCC AGC 
la Phe Ser Leu Glu Thr Ser Ala Ser 
75 



ACT GCC TAT TTG CAG ATC AAC AA? 
Thr Ala Tyr Leu Gin He Asn Asi 
80 85 

TAT TTC TGT GGA AGG CAA TTT ATT 
Tyr Phe Cys Gly Arg Gin Phe He 
100 



CTC AAA AAT GAG GAC ACG GCT ACA 
Leu Lys Asn Glu Asp Thr Ala Thr 
90 '95 

\CC TAC GGC GGG TTT GCT AAC TGG 
Ehr Tyr Gly Gly Phe Ala Asn Trp 
105 HO 



GGC CAA GGG ACT CTG GTC ACT GTC TCT GCA TCG AGC TCC TCC GGA TCT 
Gly Gin Gly Thr Leu Val Thr Val Sei Ala Ser Ser Ser Ser Gly Ser 
115 12ffi 125 



TCA TCT AGC GGT TCC AGC TCG AGC GAT 
Ser Ser Ser Gly Ser Ser Ser Ser Asp 
130 135 



kTC GTC ATG ACC CAG TCT CCT 
He Val Met Thr Gin Ser Pro 
140 



AAA TTC ATG TCC ACG TCA GTG GGA GAC AffiG GTC AGC ATC TCC TGC AAG 
Lys Phe Met Ser Thr Ser. Val Gly Asp Aife Val Ser He Ser Cys Lys 
145 150 \ 155 



47 



95 



143 



191 



239 



287 



335 



383 



431 



479 
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GCC AGT CAG GAT GTG AGT ACT GCT GTA GCC TGG 
Ala Ser Gin Asp Val Ser Thr Ala Val Ala Trp 
160 155 170 



GGG CAA TCT CCT AAA CTX CTG ATT TAC TGG ACA 
Gly Gin Ser Pro Lys Leu\Leu He Tyr Trp Thr 
180 \ 185 

GGA GTC CCT GAT CGC TTC A^CA GGC AGT GGA TCT 
Gly Val Pro Asp Arg Phe Th\r Gly Ser Gly Ser 
195 \ 200 

CTC ACC ATC AGC AGT GTG CAG^GCT GAA GAC CTG 
Leu Thr He Ser Ser Val Gin Ala Glu Asp Leu 
210 2 

CAG CAA CAT TAT AGA GTG CCG TAfc ACG TTC GGA 
Gin Gin His Tyr Arg Val Pro Tyr\ Thr Phe Gly 
225 230 

GAG ATA AAA CGG GCT GAT GGG GGA iSGT GGA TGT 
Glu He Lys Arg Ala Asp Gly Gly Gly Gly Cys 
240 245 \ 250 

GGGTCTCGTT ACGTTGCGGA TCTCGAGGCT A^CTTTACTA 

GCTCAACTGT CTGCACGCAA GCTTTTGCAG GA^ATCATGA 

TGCAGGCATG CAAGCTT 



TAT CAA CAA AAA CCA 
Tyr Gin Gin Lys Pro 
175 

TCC ACC CGG CAC ACT- 
Ser Thr Arg His Thr 
190 

GGG ACA GAT TAT ACT 
Gly Thr Asp Tyr Thr 
205 

GCA CTT CAT TAC TGT 
Ala Leu His Tyr Cys 
220 

GGG GGG ACC AAG CTG 
Gly Gly Thr Lys Leu 
235 

TAACGGGGGA GGTGGATGTT 



ACTCTTACCG TAAAGTTCTG 
GCGCTTAAGA TCCGTCGACC 



527 

575 

623 

671 

719 

772 

832 
892 
909 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: \: 

Met Ala Glu He Gin Leu Val Gin Ser Gly Pro Glu Leu Lys Lys Pro 
1 5 10 \ 15 

Gly Glu Thr Val Lys He Ser Cys Lys Ala Ser GlW Tyr Thr Phe Thr 
20 25 \ 30 

Asn Tyr Gly Met Asn Trp Val Lys Gin Ala Pro Gly\ Lys Gly Leu Lys 
35 40 \ 45 
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Trp Met Gly Tr\ lie Asn Thr Asn Thr Gly 
50 \ 55 

Glu Phe Lys Gly ArV Phe Ala Phe Ser Leu 
65 \ 70 

Ala Tyr Leu Gin He A\n Asn Leu Lys Asn 
85 \ 90 

Phe Cys Gly Arg Gin Phe \le Thr Tyr Gly 
100 \ 105 

Gin Gly Thr Leu Val Thr Val\Ser Ala Ser 
115 \20 

Ser Ser Gly Ser Ser Ser Ser As^p He Val 
130 135 

Phe Met Ser Thr Ser Val Gly Asp Arg Val 
145 150 

Ser Gin Asp Val Ser Thr Ala Val Ala\ Trp 
165 V70 

Gin Ser Pro Lys Leu Leu He Tyr Trp Tf 
180 185 

Val Pro Asp Arg Phe Thr Gly Ser Gly Ser 
195 200 

Thr He Ser Ser Val Gin Ala Glu Asp Leu 
210 215 

Gin His Tyr Arg Val Pro Tyr Thr Phe Gly 
225 230 

He Lys Arg Ala Asp Gly Gly Gly Gly Cys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Glu Pro 
60 

Glu Thr 
75 

Glu Asp 

Gly Phe 

Ser Ser 

Met Thr 
140 

Ser He 
155 

Tyr Gin 
Ser Thr 
Thr 



Thr Tyr Ala Glu 



Ser Ala Ser Thr 
80 

Thr Ala Thr Tyr 
95 

Ala Asn Trp Gly 
110 

Ser Gly Ser Ser 
125 

Gin Ser Pro Lys 



Ser Cys Lys Ala 
160 

Gin Lys Pro Gly 
175 

Arg His Thr Gly 
190 

Asp Tyr Thr Leu 
205 



Ala\Leu His Tyr Cys Gin 
S,20 

Gly GlV Thr Lys Leu Glu 

235 \ 240 



(ii) MOLECULE TYPE: cDNA 
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(ix) FEATl 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.-758 

(D) OTHER INFORMATION: /product= "26-10 sFv' with 
CVterminal Gly4-Cys" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



CC ATG GAA GTT CAA CTG 
Met Glu Val Gin Leu 
1 5 



._. CAG TCT GGT CCT GAA TTG GTT AAA CCT 
Ln Gin Ser Gly Pro Glu Leu Val Lys Pro 
10 L5 



GGC GCC TCT GTG CGC ATG TCC N 
Gly Ala Ser Val Arg Met Ser 
20 



AAA TCC TCT GGG TAC ATT TTC ACC 
lys Lys Ser Ser Gly Tyr He Phe Thr 



TGC 



25 



30 



GAC TTC TAC ATG AAT TGG GTT CG(\CAG TCT CAT GGT AAG TCT CTA GAC 
Asp Phe Tyr Met Asn Trp Val Arg\Gln Ser His Gly Lys Ser Leu Asp 
35 V»0 45 




TAC ATC GGG TAC ATT TCC CCA TAC T( 
Tyr He Gly Tyr He Ser Pro Tyr Sei 
50 55 



GGG GTT ACC GGC TAC AAC CAG 
Gly Val Thr Gly Tyr Asn Gin 
60 



AAG TTT AAA GGT AAG GCG ACC CTT ACT G^C GAC AAA TCT TCC TCA ACT 
Lys Phe Lys Gly Lys Ala Thr Leu Thr Val Asp Lys Ser Ser Ser Thr 
65 70 \ 75 ' 



\ 



GCT TAC ATG GAG CTG CGT TCT TTG ACC TCT GAG GAC TCC GCG GTA TAC 
Ala Tyr Met Glu Leu Arg Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr 
80 85 9f» 95 



TAT TGC GCG GGC TCC TCT GGT AAC AAA TGG GCC 
Tyr Cys Ala Gly Ser Ser Gly Asn Lys Trp Ala 
100 105 




lTG GAT TAT TGG GGT 
t Asp Tyr Trp Gly 
110 " 



CAT GGT GCT AGC GTT ACT GTG AGC TCC TCC GGA TCT\TCA TCT AGC GGT 
His Gly Ala Ser Val Thr Val Ser Ser Ser Gly Ser Ser Ser Ser Gly 
115 120 \ 125 

TCC AGC TCG AGT GGA TCC GAC GTC GTA ATG ACC CAG ACTv CCG CTG TCT 
Ser Ser Ser Ser Gly Ser Asp Val Val Met Thr Gin Thr \Pro Leu Ser 



130 



135 



140 



CTG CCG GTT TCT CTG GGT GAC CAG GCT TCT ATT TCT TGC CGC TCT TCC 
Leu Pro Val Ser Leu Gly Asp Gin Ala Ser He Ser Cys Argv Ser Ser 
145 150 155 \ 

CAG TCT CTG GTC CAT TCT AAT GGT AAC ACT TAC CTG AAC TGG TAC CTG 
Gin Ser Leu Val His Ser Asn Gly Asn Thr Tyr Leu Asn Trp Tyr Leu 
160 165 170 \ 175 



47 



95 



143 



191 



239 



287 



335 



383 



431 



479 



527 
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CAA AAG GCT GGT CAG TCT CCG AAG CTT CTG ATC TAC AAA GTC TCT AAC 
Gin l3vs Ala Gly Gin Ser Pro Lys Leu Leu He Tyr Lys Val Ser Asn 
180 185 190 

CGC TTC ICT GGT GTC CCG GAT CGT TTC TCT GGT TCT GGT TCT GGT ACT 
Arg Phe SAr Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr 
195 200 205 

GAC TTC ACC CTG AAG ATC TCT CGT GTC GAG GCC GAA GAC CTG GGT ATC 
Asp Phe Thr Le\ Lys He Ser Arg Val Glu Ala Glu Asp Leu Gly He 
210 \ 215 220 

TAC TTC TGC TCT Ca\aCT ACT CAT GTA CCG CCG ACT TTT GGT GGT GGC 
Tyr Phe Cys Ser Gin \hr Thr His Val Pro Pro Thr Phe Gly Gly Gly 
225 \ 230 235 



ACC AAG CTC GAG ATT AAA 
Thr Lys Leu Glu He Lys 
240 245 

CCGGGGGATC C 



IT TCC GGG GGA GGT GGA TGT TAACTGCAGC 
Ark Ser Gly Gly Gly Gly Cys 
250 



575 



623 



671 



719 



768 



779 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino aci< 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Glu Val Gin Leu Gin Gin Ser Gly Pro Glu Leu ^al Lys Pro Cly 
1 5 10 \ 15 

Ala Ser Val Arg Met Ser Cys Lys Ser Ser Gly Tyr He ^e Thr Asp 
20 25 \0 

Phe Tyr Met Asn Trp Val Arg Gin Ser His Gly Lys Ser Leu\Asp Tyr 
35 40 45 

He Gly Tyr He Ser Pro Tyr Ser Gly Val Thr Gly Tyr Asn Gl'to Lys 
50 55 60 

Phe Lys Gly Lys Ala Thr Leu Thr Val Asp Lys Ser Ser Ser Thr A"3 
65 70 75 

Tyr Met Glu Leu Arg Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr 
85 90 95 
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Cys Ala Gly \er Ser Gly Asn Lys Trp Ala Met Asp Tyr 
jO 105 

Gly Ala Ser ValVhr Val Ser Ser Ser Gly Ser Ser Ser 
115 \ 120 125 

Ser Ser Ser Gly Se\Asp Val Val Met Thr Gin Thr Pro 
130 \ 135 140 

Pro Val Ser Leu Gly As\ Gin Ala Ser He Ser Cys Arg 
145 150\ 155 

Ser Leu Val His Ser Asn g\v Asn Thr Tyr Leu Asn Trp 
165 \ 170 

Lys Ala Gly Gin Ser Pro Lys L\u Leu He Tyr Lys Val 
180 \ 185 

Phe Ser Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser 
195 200 \ 205 

Phe Thr Leu Lys He Ser Arg Val Glu Via Glu Asp Leu 
210 215 \ - 220 

Phe Cys Ser Gin Thr Thr His Val Pro ProVrhr Phe Gly 
225 230 X35 

Lys Leu Glu He Lys Arg Ser Gly Gly Gly GlV Cys 
245 250 X 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 739 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



Trp Gly His 
110 

Ser Gly Ser 



Leu Ser Leu 

Ser Ser Gin 
160 

Tyr Leu Gin 
175 

Ser Asn Arg 
190 

Gly Thr Asp 
Gly He Tyr 



Gly Gly Thr 
240 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..729 . 
(D) OTHER INFORMATION: /product= "520C9 sFv polypeptide 

sequence" 
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*1) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GAG ATC 
Glu He 
1 



TTG GTG CAG TCT GGA CCT GAG CTG AAG AAG CCT GGA GAG 
Leu Val Gin Ser Gly Pro Glu Leu Lys Lys Pro Gly Glu 
5 10 15 



ACA GTC AAG ATC TCC TGC AAG GCT TCT GGA TAT ACC TTC GCA AAC TAT 
Thr Val Lys Ile\Ser Cys Lys Ala Ser Gly Tyr Thr Phe Ala Asn Tyr 
20 \ 25 30 

GGA ATG AAC TGG ATG.AAG CAG GCT CCA GGA AAG GGT TTA AAG TGG ATG 
Gly Met Asn Trp MetVys Gin Ala Pro Gly Lys Gly Leu Lys Trp Met 
35 \ 40 45 

GGC TGG ATA AAC ACC TACvACT GGA CAG TCA ACA TAT GCT GAT GAC TTC 
Gly Trp He Asn Thr Tyr Thr Gly Gin Ser Thr Tyr Ala Asp Asp Phe 
50 55 60 

AAG GAA CGG TTT GCC TTC TCTNtTG GAA ACC TCT GCC ACC ACT GCC CAT 
Lys Glu Arg Phe Ala Phe Ser Leu Glu Thr Ser Ala Thr Thr Ala His 
65 70 \ 75 80 



TTG CAG ATC AAC AAC CTC AGA AAT 
Leu Gin He Asn Asn Leu Arg Asn 
85 



jKG GAC TCG GCC ACA TAT TTC TGT 
Asp Ser Ala Thr Tyr Phe Cys 
90 95 



GCA AGA CGA TTT GGG TTT GCT TAC TGG 
Ala Arg Arg Phe Gly Phe Ala Tyr Trp 
100 105 




CAA GGG ACT CTG GTC AGT 
.y Gin Gly Thr Leu Val Ser 
110 



GTC TCT GCA TCG ATA TCG AGC TCC TCC GGA^TCT TCA TCT AGC GGT TCC 
Val Ser Ala Ser He Ser Ser Ser Ser Gly Ser Ser Ser Ser Gly Ser 
115 120 \ 125 

AGC TCG AGT GGA TCC GAT ATC CAG ATG ACC CAG Vt CCA TCC TCC TTA 
Ser Ser Ser Gly Ser Asp He Gin Met Thr Gin S&r Pro Ser Ser leu 
130 135 140\ 

TCT GCC TCT CTG GGA GAA AGA GTC AGT CTC ACT TGT CGG GCA AGT CAG 
Ser Ala Ser Leu Gly Glu Arg Val Ser Leu Thr Cys ArgyAla Ser Gin 
145 150 155 . \ 160 

GAC ATT GGT AAT AGC TTA ACC TGG CTT CAG CAG GAA CCA GaV GGA ACT 
Asp He Gly Asn Ser Leu Thr Trp Leu Gin Gin Glu Pro Asp Gly Thr 
165 170 1^5 

ATT AAA CGC CTG ATC TAC GCC ACA TCC AGT TTA GAT TCT GGT GTC\CCC 
He Lys Arg Leu He Tyr Ala Thr Ser Ser Leu Asp Ser Gly Val Rro 
180 185 190 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 
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AAA AGG TTC AGT GGC AGt CGG TCT GGG TCA GAT TAT TCT CTC ACC ATC 
Lys Are Phe Ser Gly Ser\Arg Ser Gly Ser Asp Tyr Ser Leu Thr lie 
195 \ 200 205 

AGT AGC CTT GAG TCT GAA gVt TTT GTA GTC TAT TAC TGT CTA CAA TAT 
Ser Ser Leu Glu Ser Glu Asp Phe Val Val Tyr Tyr Cys Leu Gin Tyr 
210 215 220 

GCT ATT TTT CCG TAC ACG TTcVxJA GGG GGG ACC AAC CTG GAA ATA AAA 
Ala He Phe Pro Tyr Thr Phe Gly Gly Gly Thr Asn Leu Glu He Lys 
225 230 \ 235 240 

CGG GCT GAT TAATCTGCAG 
Arg Ala Asp 



624 



672 



720 



739 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISE CSN 

(A) LENGTH: 243 amino a^eids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Glu He Gin Leu Val Gin Ser Gly Pro Glu Le\ Lys 
1 5 10 



Thr Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr 
20 25 



Gly Met Asn Trp Met Lys Gin Ala Pro Gly Lys Gly* 
35 40 



Lys Pro Gly Glu 
15 

Phe Ala Asn Tyr 
30 

,Leu Lys Trp Met 
45 



Gly Trp He Asn Thr Tyr Thr Gly Gin Ser Thr Tyr A\a Asp Asp Phe 
50 55 60 



Lys Glu Arg Phe Ala Phe Ser Leu Glu Thr Ser Ala 
65 70 75 

Leu Gin He Asn Asn Leu Arg Asn Glu Asp Ser Ala 
85 90 

Ala Arg Arg Phe Gly Phe Ala Tyr Trp Gly Gin Gly 
100 105 

Val Ser Ala Ser lie Ser Ser Ser Ser Gly Ser Ser 
115 120 



Thr\Thr Ala His 
80 

Thr Ty^r Phe Cys 
95 



Thr Leu 
110 



VVal Ser 



Ser Ser G\y Ser 
125 
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Ser Ser Serfcly Ser Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu 
130 \ 135 

Ser Ala Ser Leu\ly Glu Arg Val Ser Leu Thr Cys Arg Ala Ser Gin 
145 \ 150 155 lbU 

Asp He Gly Asn Ser \u Thr Trp Leu Gin Gin Glu Pro Asp Gly Thr 
K i^c \ 170 1'5 



165 



lie Lys Arg Leu lie Tyr\la Thr Ser Ser Leu Asp Ser Gly Val Pro 
180 \ 185 190 

Lys Arg Phe Ser Gly Ser Arg\>er Gly Ser Asp Tyr Ser Leu Thr lie 



195 



Ser Ser Leu Glu Ser Glu Asp Phe\Val Val Tyr Tyr Cys Leu Gin Tyr 
210 215 \ 220 

Ala He Phe Pro Tyr Thr Phe Gly G^ Gly Thr Asn Leu Glu lie Lys 
225 230 \ 235 240 



Arg Ala Asp 




(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /note= "Linker 1" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly GlV Ser 
1 S 10 X 1 ^ 



4 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUEtfKE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE*: amino acid 

(C) STRANE^EDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:\Drotein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1.-15 \ 

(D) OTHER INFORMATIONV /note= " LINKER 2" 



(xi) SEQUENCE DESCRIPTION: SEq\d NO: 8: 



sr Gly Ser Ser Ser Ser Gly 
v 10 15 




Ser Ser Ser Ser Gly Ser Ser Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..2 
(D) OTHER INFORMATION: /note= "C-Terminal TaiK (Ser-Cys) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9; 

Ser Cys 
1 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY:, linear 
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(iN MOLECULE TYPE: protein 



(ix) FEATURE: 

(a\nAME/KEY: Protein 
(B)\OCATION: 1..5 

(D) OTHER INFORMATION: /note= "C-Terminal Tail 
UJly-Gly-Gly-Gly-Cys) " 



iSI 



;W1 




5 Cy 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Gly Gly Gly CysS 
1 5 

(2) INFORMATION FOR SEQ ID M9:ll: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino a\ids 

(B) TYPE: amino acid \ 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1. .11 x 
(D) OTHER INFORMATION: /note= "C-Te\minal Tail 

(His-His-His-His-His-His-Gly-Gly-Gly-Gly-Cys ) " 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

His His His His His His Gly Gly Gly Gly Cys 
1 5 10 



